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and the written system of English in order to intervene strategically to assist 
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reading. As part of this the DES commissioned a team at NFER to conduct 
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who saw that there was a gap in the literature, not because there were no 
relevant books - see those by Albrow, Gimson (early editions), Venezky and 
Wijk in the references - but because they were all far too technical. He was 
then a commissioning editor of books on language and literacy, and invited 
me to Leeds to discuss what a book ‘dealing with the complex relationships 
between the orthography and the phonology of English’ should contain, and 
who might write it. We agreed that it should cover comprehensively both 
grapheme-phoneme and phoneme-grapheme relationships, and provide 
information of use to teachers. | was too busy to write it, but Roger already 
had John Mountford in mind. John’s book (Mountford, 1998) took a while to 
appear, and even in early drafts was clearly rather different from the book 
Roger and | had envisaged. So | began to work in earnest on my own. 

The need for ‘a large computer and a lot of programming’ that Tom 
Gorman had once warned me about was obviated by the work of others 
who had those resources: first for frequencies of phoneme-grapheme 
correspondences by Carney (1994), and later for frequencies of grapheme- 
phoneme correspondences by Gontijo et a/. (2003). (Carney’s book also 
contained grapheme-phoneme frequencies, but in a form unsuited to 
my purposes.) However, the more | delved into the intricacies of English 
spelling, the larger my comprehensive lists and analyses became - hence 
the size of this book. 
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Along the way a teacher-friendlier spin-off was possible. When Laura 
Huxford and Jenny Chew were drafting the Letters and Sounds materials 
(DfES, 2007), they consulted me on details of correspondences (both 
directions), and | provided them with handy phonics-friendly tables of the 
main ones which (slightly modified) appeared in Letters and Sounds. In 
2008 | also provided them to Roger Beard, who was then chairing a panel 
evaluating phonics schemes. He has commented (personal communication, 
2013) that they ‘proved to be succinct, easily accessible and linguistically 
accurate’. Further versions appear in Appendix B here. 

However, the battle to convince policy-makers and others of the need for 
teachers to understand the phonetic underpinnings of phonics has yet to be 
won. When | was on the Rose Committee, Maxine Burton and | submitted 
a paper (Burton and Brooks, 2005) to the committee putting the case for 
using the International Phonetic Alphabet (IPA) in teaching teachers about 
phonics, but this was ignored, as was my attempt to convince Laura and 
Jenny to use IPA symbols (rather than those of their own devising) in Letters 
and Sounds. | have argued that case into an apparent wilderness again 
(Brooks, 2007, 2011), but Maxine’s book Phonetics for Phonics (Burton, 
2011), which also contains the eight tables of correspondences included 
here in Appendix B, appears to be having some impact. We will fight on, and 
| hope the uncompromising use of IPA in this book will support that. 


How to use this book 


To find an explanation of International Phonetic Alphabet (IPA) symbols, see 
chapter 2. 
To look up the various ways a consonant phoneme is spelt, find its entry in 
chapter 3. 
To look up the various ways a vowel phoneme is spelt, find its entry in 
chapter 5. 
To look up the various ways a grapheme beginning with a consonant letter 
is pronounced, find its entry in chapter 9. 
To look up the various ways a grapheme beginning with a vowel letter is 
pronounced, find its entry in chapter 10. 
To find full lists and numbers of graphemes and phoneme-grapheme 
correspondences, see chapter 8. 
To find lists of the major grapheme-phoneme correspondences, see Table 
9.4 in chapter 9 and Table 10.1 in chapter 10. 
For teacher-friendlier lists of both kinds of correspondence, see Appendix B. 
Rules and hints for writing consonant letters double are in chapter 4. 
Some spelling rules for vowel phonemes are in chapter 6. 
Chapter 11 evaluates a few pronunciation rules for vowel graphemes. 
For discussion of assumptions and technicalities see Appendix A. 
To find discussions of individual words, search for them in the online 
version, as follows: 
Find the book at http://www.openbookpublishers.com/product/325 
Click on ‘READ THE HTML’. 
In the ‘Search this book’ box enter the word you’re interested in. 
If this fails to produce even one Google search result, sorry, the word 
isn’t in the book. 
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But if a Google search result does come up, click on it to bring up 
the relevant chapter, then press Control+F, enter the word you’re 
interested in and press Return. Happy browsing! 
Caveat emptor: Here are some things this book is not about (see also the 
penultimate paragraph of section 1.2): 
It has very little to say about the teaching of spelling - for a handy 
online guide to that and to the anlaysis of spelling errors, see http: // 
www.meshguides.org/spelling/ 
It does not attempt to teach the technicalities of phonetics or 
phonology - for those see Cruttenden (2014) and Roach (2009); 
Because the range of accents with which English is spoken is so vast, 
attempting to relate English spelling to all of them would produce an 
encyclopedia, hence this book focuses solely on the British Received 
Pronunciation accent (and British spelling). Should | live long enough, 
| may try to produce a parallel volume on the General American accent 
(and US spelling); 
It does not attempt to relate the description of the spelling system 
to psycholinguistic theories about the processes involved in reading 
and spelling (e.g. ‘dual-route theory’) - for all of that | recommend 
Snowling and Hulme (2005); 
It does not tackle in any detail the question of how to tell from the 
written forms of English words which of their spoken counterparts’ 
syllables are stressed and which are not - but for some reflections on 
this see section A.10 in Appendix A. 
To hear the phonemes of English pronounced in context with an RP accent, 
try this British Library website: http: //www.bl.uk/learning/langlit/sounds/ 
case-studies/received-pronunciation 
For very user- and teacher-friendly guidance on spelling, Jazzy Spelling 
Secrets: Teacher’s Toolkit, shortly to be published by Jaz Ampaw-Farr of 
Which Phonics Ltd, sign up to her website: http: //whichphonics.co.uk 


1. Introduction 


1.1 Context 


English spelling is notoriously complicated and difficult to learn, and is 
correctly described as much less regular and predictable than any other 
alphabetic orthography. The 40+ distinctive speech sounds (phonemes) of 
the spoken language are represented by a multiplicity of letters and letter- 
combinations (graphemes) in the written language; correspondingly, many 
graphemes have more than one pronunciation. This is because English has 
absorbed words from many other languages (especially French, Latin and 
classical Greek) into its Germanic base, and mainly taken over spellings 
or transliterations of those words without adapting them to the original 
system. Two recent books (Crystal, 2012; Upward and Davidson, 2011) tell 
this story with wit and insight. 

However, there is more regularity in the English spelling system than 
is generally appreciated. This book, based on a very detailed analysis of 
the relationships between the phonemes and graphemes of British English, 
provides a thorough account of the whole complex system. It does so by 
describing how phonemes relate to graphemes, and vice versa. It is intended 
to be an authoritative reference guide for all those with a professional interest 
in English spelling, including and especially those who devise materials for 
teaching it, whatever their students’ age and whether their own or their 
students’ mother tongue is English or not. It may be particularly useful 
to those wishing to produce well-designed materials for teaching initial 
literacy via phonics (for guidance on the phonetics which should underpin 
accurate phonics teaching see Burton, 2011), or for teaching English as a 
foreign or second language, and to teacher trainers. 
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The book is intended mainly as a work of reference rather than theory. 
However, all works of reference are based on some theory or other, whether 
or not explicitly stated for readers, and even if not consciously known to 
the writer. For the assumptions | have made and for discussion of technical 
issues, see Appendix A. 


1.2 Aims 


My aims are to set out: 

1) the distinctive speech sounds (phonemes) of spoken English 

2) the letters and letter-combinations (graphemes, spelling choices) of 
written English 

3) how the phonemes of spoken English relate to the graphemes of written 
English 

4) the mirror-image of that, that is, how the graphemes of written English 
relate to the phonemes of spoken English 

5) some guidance on the patterning of those relationships. 

The core of the book is the chapters in which | set out the relationships 

(correspondences) between phonemes and graphemes: 


Phoneme-grapheme Grapheme-phoneme 
correspondences correspondences 
Consonants | Chapter 3 Chapter 9 
Vowels Chapter 5 Chapter 10 


Although chapters 9 and 10 are concerned with howthe graphemes of English 
are pronounced, those seeking guidance on how to pronounce (including 
where to stress) whole English words, given only their written form, should 
instead consult a pronouncing dictionary in which the International Phonetic 
Alphabet is used, e.g. the Cambridge English Pronouncing Dictionary, 18th 
Edition (Cambridge: Cambridge University Press, 2011). The phonetic 
transcription system used in this book (see chapter 2) is identical to the 
system used in that dictionary. A useful guide for those who are uncertain 
whether, for example, an English word beginning with a ‘yuh’-sound begins 
with the letter <y> or the letter <u> is the ACE (Aurally Coded English) 
Spelling Dictionary by David Moseley (1998). 

| make only a few suggestions in this book about how to teach English 
spelling - my aim is mainly to set out my analysis of the system. However, 


Introduction 3 


some findings may have pedagogical applications - see especially chapter 
11, section A.7 in Appendix A, and Appendix B. | also make no attempt to 
justify English spelling or summarise its history (see again Crystal, 2012; 
Upward and Davidson, 2011), and make only a very few remarks on changes 
that might be helpful. For a few other things which this book does not 
attempt to do see p.xii. 

My analysis is confined to the main vocabulary of English - | almost 
entirely omit the extra complexities of spellings which occur only in personal, 
place- and brand-names (though | mention a few where they parallel rare 
spellings in ordinary words; see also section A.5 in Appendix A), archaic or 
obsolete words, words which occur in non-standard dialects of English but 
not in Standard English, culinary terms with spelling patterns which occur 
in no other word, words known only to Scrabble addicts, and new spellings 
in text messaging. And there are intricacies which | have glossed over or 
passed over in silence - if you want to go further consult one or more of the 
books listed in the references. 


1.3 Some terminology 


Some familiarity with linguistic and grammatical terminology is assumed, 
e.g. ‘indefinite article’, ‘noun’, ‘adjective’, ‘verb’, ‘adverb’, ‘content word’, 
‘function word’, ‘singular’, ‘plural’, ‘third person’, ‘present’, ‘past’, ‘tense’, 
‘participle’, ‘possessive’, ‘bound forms’, ‘affix’, ‘prefix’, ‘suffix’, ‘syllable’, 
‘penultimate’, ‘antepenultimate’. Some terms, however, are used in different 
senses by different writers and/or are less familiar - most of those | find 
indispensable are explained in the remaining sections of this chapter (and 
various others in sections 2.3, 2.5, 3.6, 5.5.3, 6.4-6, 6.10, 7.1, 7.2). 
Throughout the book, 
| refer to the distinctive speech sounds of spoken English as ‘phonemes’ 
and show them between forward slashes; for example, /b/ is the first 
phoneme in the word bad; 
| refer to the spelling choices of written English as ‘graphemes’ and 
show them between angled brackets; for example, <p> is the first 
grapheme in the word pad; and 
| refer to the relationships between phonemes and graphemes (in both 
directions) as ‘correspondences’. 
An asterisk before a word indicates that the word is misspelt, e.g. 
*accomodation, *hastle, *occured. 
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1.4 Phonemes 


Phonemes are distinctive speech sounds, that is, they make a difference to 
the meanings of words. For example, the difference between /b/ and /p/ 
makes the difference in meaning between bad and pad. (There is of course 
much more to this - for some discussion, see Appendix A, section A.2). 

In English, phonemes fall into two main categories, consonants and 
vowels. These terms may well be familiar to you as categories of letters, 
but the very familiarity of these labels for letters may cause confusion when 
thinking about phonemes. For one thing, there are many more phonemes 
in spoken English (44 or thereabouts) than there are letters in the English 
version of the Roman alphabet (26). For another, some graphemes are 
used to represent both consonant and vowel phonemes - the most familiar 
example being the letter <y>. 

To phoneticians, the difference between consonant and vowel phonemes 
is that consonants require some obstruction of the airflow between lungs 
and lips, whereas vowels do not. For technical details on this see Peter 
Roach (2009) English Phonetics and Phonology, Fourth edition, Cambridge: 
Cambridge University Press. However, for practical purposes a test for 
distinguishing between consonant and vowel phonemes which works 
for English is that the indefinite article, when immediately followed by a 
word which begins with a consonant phoneme, takes its a form, but when 
immediately followed by a word which begins with a vowel phoneme takes 
its an form. So hand, union and one-off begin with consonant phonemes, 
but hour, umbrella and on-off begin with vowel phonemes. 

Vowel phonemes can consist of one or two sounds. Those which consist 
of one sound are pure vowels, and those which consist of two sounds are 
diphthongs. When you pronounce a pure vowel, your jaw, lips, etc., remain 
relatively stationary; when you pronounce a diphthong, they move. Try 
saying the words awe (which consists in speech of one pure vowel) and then 
owe (which consists in speech of a diphthong), and feel the difference. 

(For long and short vowels see sections 1.5 and 2.4). 

In contrast, most consonant phonemes consist of only one sound, though 
they can of course occur in clusters, for example at the beginning and end 
of strengths. The only consonant phonemes in English which consist of 
two sounds are those at the beginning of chew and jaw - see the complex 
symbols for these phonemes in Table 2.1 in chapter 2. 

(For consonant clusters and blends see section 1.7). 
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1.5 Long and short vowels 


To many teachers, a short vowel is a sound related within the teaching 
approach known as phonics to one of the letters <a, e, i, 0, u>, anda long 
vowel is a different sound related within phonics to one of the same five 
letters. In this book the terms ‘short vowel’ and ‘long vowel’ are not used in 
this way, but in the senses they have in phonetics. To phoneticians, a short 
vowel is a phoneme that takes only a few milliseconds to pronounce, and 
a long vowel is a phoneme that takes rather longer to pronounce. Both are 
pure vowels in the sense defined in section 1.4, and both categories are 
listed and exemplified in section 2.4, where it is shown that the English 
accent on which this book is based has seven short pure vowel phonemes 
and five long pure vowel phonemes. 

Five of the short pure vowels are indeed the sounds associated with the 
letters <a, e, i, 0, u> in phonics teaching, but there are two more short 
vowels in the phonetic sense: the sound represented by letter <u> in put, 
and the sound represented by letter <a> in about. And of the five so-called 
long vowels associated with the letters <a, e, i, 0, u> in phonics teaching, 
only the name of letter <e> is along pure vowel in the phonetic sense; three 
are diphthongs (the names of <a, i, o>), and the name of <u> is a sequence 
of two phonemes, the sound of letter <y> when it begins a word followed 
by the sound of the exclamation ‘Oo!’. 

However, the sounds which are the names of the letters <a, e, i, 0, u>, 
plus the phoneme whose sound is ‘oo’ (phonetic symbol /ur/), do have 
some useful spelling properties as a set. | make use of this fact in chapters 
5 and 6, where you will find them grouped together as the ‘letter-name 
vowels plus /uz/’. See also section 1.10 below. 


1.6 Graphemes 


| define graphemes as single letters or letter-combinations that represent 
phonemes. (Again, there is more to it than this - for some discussion, see 
Appendix A, section A.4). 

Graphemes come in various sizes, from one to four letters. | call 
graphemes consisting of one, two and three letters ‘single-letter graphemes’, 
‘digraphs’ and ‘trigraphs’ respectively. Where it is necessary to mention 
four-letter graphemes (of which there are 19, in my analysis - see Tables 
8.1-2), for example <ough> representing a single phoneme as in through, 
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| call them ‘four-letter graphemes’ (and not ‘tetragraphs’ or ‘quadgraphs’). 
Graphemes of all four sizes are used in English to spell both consonant and 
vowel phonemes. 


1.7 Consonant clusters and ‘blends’ 


As already illustrated with the word strengths, consonant phonemes 
(and letters) can occur in groups. Many teachers use the term ‘blend’ for 
such groups, but | have observed that it is often used to cover not only 
groups of consonant phonemes or letters, but also digraphs and trigraphs 
representing single consonant phonemes - which can and does create two 
sources of confusion. First, using ‘blend’ in this way means that letters and 
sounds are being muddled up; it is a central tenet of my approach that 
graphemes and phonemes must be carefully distinguished. 

Secondly, it encourages some teachers to think that ‘blends’ need to 
be taught as units, rather than as sequences of letters and phonemes. For 
example, it makes more sense to teach <bl> at the beginning of the word 
blend itself as two units, <b> pronounced /b/ and <I> pronounced /I/ 
(segmentation, in the terminology of synthetic phonics), and then merge 
them into /bl/ (blending (!), again in the terminology of synthetic phonics, 
where this term is entirely appropriate). For both analytical and teaching 
purposes the two categories of clusters and multi-letter graphemes are best 
kept apart. | therefore stick with the term ‘clusters’ for groups of consonant 
phonemes or letters, and avoid the term ‘blend’ completely. 


1.8 Split digraphs and ‘magic <e>’ 


In English spelling there are six digraphs which are not written continuously 
but are interrupted by a consonant letter (or occasionally two consonant 
letters or a consonant letter plus <u>). These digraphs have one of the 
letters <a, e, i, 0, u, y> as the first letter and <e> as the second letter, and 
in most cases that <e> marks the first vowel letter as having what teachers 
call its ‘long’ sound (if we accept, which never seems to be pointed out, that 
the ‘long’ sound of <y> when used as a vowel letter is the same as that of 
<i>). For example, in bite the ‘eye’ sound is represented by the letters <i, e> 
even though they are separated by the <t>. | call digraphs which consist of 
two separated letters ‘split digraphs’. To symbolise split digraphs | write the 
two relevant letters with a dot between them; for example, the split digraph 
representing the ‘eye’ sound in bite is written as <i.e>. In my analysis, the 
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full set of split digraphs is <a.e, e.e, i.e, 0.e, u.e, y.e>. | have not found 
it necessary to posit more complicated graphemes such as <ae.e> (‘split 
trigraphs’?) - see section A.6 in Appendix A. 

Split digraphs occur only towards the end of written stem words. They 
have no place in conventional alphabetical order, so when | need to include 
them in alphabetical lists, | place them immediately after the digraph 
consisting of the same two letters but not split, for example <a.e> comes 
after <ae> (or sometimes where the unsplit digraph would be, if it happens 
not to be needed in a particular list). 

(I also posit four graphemes containing apostrophes: <e’er, e’re, ey’re, 
ou’re> - these | place in lists as if the apostrophe were a 27th letter of the 
alphabet. See section A.9 in Appendix A). 

Many teachers refer to the split digraph use of <e> as ‘magic <e>’. 
While this seems perfectly valid pedagogically (and | use the expression 
occasionally in this book), | mostly use the term ‘split digraph’ because not 
all occurrences of the split digraphs contain ‘magic <e>’ in the sense that 
the other vowel letter has its usual ‘long’ pronunciation. (See the entries for 
<a.e, e.e, i.e, 0.e, u.e, y.e> in chapter 10, sections 10.4/17/24/28/38/40). 

For a more technical discussion of split digraphs see section A.6 in 
Appendix A, and for a pedagogical discussion of ‘magic <e>’ rules see 
section 11.4. 


1.9 Stem words and derived forms 


Stem words are those which are indivisible into parts which still have 
independent meaning; derived forms are all other words, i.e. those which 
contain either a stem word and one or more prefixes or suffixes, and/or two 
(or more) stem words combined into a compound word. This book is mainly 
concerned with stem words, but some sections apply specifically to derived 
forms (e.g. section 4.2 on the rule for doubling stem-final consonant letters 
before suffixes beginning with a vowel letter). | try throughout to indicate 
where rules or correspondences differ between stem words and derived 
forms, sometimes in separate lists, sometimes by using brackets round 
prefixes and suffixes; and | often refer to derived forms as ‘derivatives’. 


1.10 Positions within words 


Many correspondences are specific to particular positions in words, some 
to the beginnings of words (‘word-initial position’), some to the middle of 
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words (‘medial position’), some to the ends of words (‘word-final position’). 
In chapters 3-7, that is, all the chapters concerned with the sound-to- 
symbol direction, | have tried to be consistent in using ‘initial’, ‘medial’ and 
‘final’ only in terms of phonemes (or, where specifically indicated, syllables 
- see third and fourth paragraphs below). For example, the phoneme /j/ (the 
sound of letter <y> at the beginning of a word) is in word-initial position 
in both yell and union. 

Word-final position applies to consonant phonemes even when the letters 
representing them occur within split digraphs, e.g. the /t/ phoneme in bite 
is in word-final position even though the letter <t> is not. Correspondingly, 
vowel phonemes and diphthongs spelt by the split digraphs are never 
word-final - as I’ve just implied with the example of bite, there is always a 
consonant phoneme after the vowel phoneme or diphthong, even though 
the letter <e> is at the end of the written word. In section 5.5.3 (only) | also 
refer to ‘pre-final’ position, that is, the phoneme immediately preceding the 
last phoneme in a word. 

In chapters 3-7 | frequently use the term ‘word-final position’ to mean 
the end of stem words. For instance, when | say that the grapheme <sh> is 
the regular spelling of the ‘sh’ phoneme in word-final position | include its 
occurrences in both fish and fishing. Even more generally, when | say that a 
particular correspondence occurs in a stem word, this also applies to words 
derived from it, unless otherwise stated. 

Other correspondences are specific to particular syllables in words; some 
are specific to monosyllabic words and the final syllables of polysyllabic 
words - | call these collectively ‘final syllables’ - and others to syllables 
before the last one in words of more than one syllable (‘non-final syllables’). 
In sections 10.27 and 10.36 | also distinguish between penultimate and 
antepenultimate syllables, that is, those immediately before the final 
syllable and immediately before that in words with enough syllables; and 
in section 10.42 antepenultimate syllables reappear, along with the fourth 
syllable from the end of a word. 

The largest set of exceptions to analysing phoneme-grapheme 
correspondences according to intial, medial and stem-final phonemic 
positions within words relates to the letter-name vowels, plus /ur/. As will 
be shown in section 5.1, these need instead to be analysed according to 
final v. non-final syllables. 

Some authors use the terms ‘polysyllables’ and ‘polysyllabic words’ 
to refer to words of three or more syllables, and therefore distinguish 
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systematically between monosyllables, disyllables (two-syllable words) and 
polysyllables. However, in my analysis | have mainly found it unnecessary 
to distinguish between disyllables and longer words, and therefore use the 
terms ‘polysyllables’ and ‘polysyllabic words’ to refer to words of two or 
more syllables. On the few occasions when a process operates specifically 
in words of two syllables (see especially the second part of the main 
consonant-doubling rule, section 4.2) | refer to them as two-syllable words, 
and similarly for longer words. 

In chapters 9 and 10, which deal with the grapheme-phoneme direction, 
the meanings of ‘initial’, ‘medial’ and ‘final’ referring to positions in words 
necessarily change: there they refer to positions in written words. So, for 
instance, there the ‘magic <e>’ in split digraphs is described as being in 
word-final position, and consonant letters enclosed within split digraphs 
are in medial position. 


1.11 Open and closed syllables 


Many vowel correspondences differ between open and closed syllables. 
Open syllables end in a vowel phoneme; closed syllables end in a consonant 
phoneme. The distinction is clearest in monosyllabic words; for example, go 
is an open syllable, goat is a closed syllable. 

For more on syllables in general, see section A.3 in Appendix A. 


1.12 ‘2-phoneme graphemes’ 


In English spelling, the letter <x> frequently spells /ks/, which is a sequence 
of two phonemes, /k/ and /s/. An example is the word box. So when <x> 
spells /ks/ | call it a ‘2-phoneme grapheme’. (Carney, 1994: 107-8 has 
a rather different approach to ‘two-phoneme strings’.) My analysis has 
uncovered 36 of these in all (see Tables 8.1-2). 

When dealing with phoneme-grapheme correspondences in chapters 3 
and 5, | mention each 2-phoneme grapheme in two places, one for each 
of the phonemes it spells. For example, you will find <x> spelling /ks/ 
under both /k/ and /s/ (sections 3.7.1, 3.7.6). However, in chapters 9 and 
10, which deal with grapheme-phoneme correspondences, each multi- 
phoneme grapheme is mentioned in only one place, under its leading letter. 

One of the 2-phoneme graphemes - <u> spelling /ju:x/ (the sound of 
the whole words ewe, yew, you and the name of the letter <u>) - is so 
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frequent that | have infringed my otherwise strictly phonemic analysis to 
accord the 2-phoneme sequence /ju:/ special status as a quasi-phoneme 
that is important enough to have its own entry - see Table 2.2 and section 
5.7.5 - as does Carney (1994: 200-2). 

Two of the 2-phoneme graphemes also function as 3-phoneme 
graphemes: <x> spelling /eks/ in X(-ray), etc., and <oir> spelling /wata/ 
(the pronunciation of the whole word wire) only in choir. Logically, therefore, 
each of these is dealt with in several places in chapters 3 and 5 (but in 
chapters 9 and 10, only once, under <x> and <o> respectively). 

For what | have called ‘2-phoneme graphemes’ Haas (1970: 49, 70) 
suggested the term ‘diphone’, to parallel ‘digraph’ - but it never caught on 
(though ‘diphone’ is used in phonetics to mean a sequence of two sounds 
or the transition between them). If it had caught on, my identification of 
3-phoneme graphemes would logically have required coining ‘triphone’ 
(which also exists in phonetics and means ‘a sequence of three phonemes’). 
| have stuck with my terminology. 


1.13 ‘Regular’ correspondences 


| refer to many correspondences as ‘regular’. This does not mean that they 
apply always and without exception. Very few spelling correspondences in 
English have no exceptions. (At least in the main system - many minor 
correspondences have no exceptions, but are very restricted in scope. One 
example is the grapheme <aigh>, which is always pronounced like the 
name of letter <a> - but since it occurs only in the word straight, this is not 
much help.) So | use the word ‘regular’ to mean ‘predominant’, the major 
tendency. 

Where lesser generalisations are possible | try to state only those that 
are helpful. For instance, Carney (1994: 185), in the course of analysing the 
correspondences of the vowel phoneme /2:/ (the sound of the word awe) 
shows that only spellings with <or> occur before four particular consonants 
or consonant clusters, and that spellings with <or> never occur before six 
others. But these generalisations only cover just over 30 words, so | have 
ignored them. For a contrast, see Table 3.5, where | organise spellings of 
word-final /s/ as in hiss into 11 subcategories - justified by the very large 
number of words with this final consonant phoneme and relatively small 
amounts of overlap between the subcategories. 
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Also, some words which seem quite irregular in the phoneme-grapheme 
(spelling) direction are less so in the grapheme-phoneme (reading aloud) 
direction, for example ocean. This is partly irregular in the phoneme- 
grapheme direction: every other word which ends in the sound of the word 
ocean is spelt <-otion>, so in ocean the spellings of the ‘sh’ phoneme as 
<ce> and the following schwa vowel (see chapter 2) as <a> are unusual in 
this context. However, in the grapheme-phoneme direction ocean is entirely 
regular: all words ending in <-cean> have the stress on the preceding vowel, 
which has its letter-name pronunciation, ‘Oh’ in this case, and the <-cean> 
ending, though rare, is always pronounced roughly like the word shun. 

On the other hand, when | speak of ‘regular verbs’ the word ‘regular’ 
has its usual sense - these are the verbs (the great majority) that form both 
past tense and past participle (in writing) by adding <-ed> (see sections 
3.5.2, 3.5.7, 5.4.3 and 10.15 for the phonetic equivalents). Some oddities 
can be noted here: the past tense and participle forms of the verbs lay, pay 
are pronounced regularly as /lerd, petd/ but are spelt irregularly: laid, paid 
(regular spellings would be */ayed, *payed - which do appear occasionally 
- see sections 3.7.1, 5.7.1 and 6.5). Similarly, regular spellings of the 
adverbs daily (also an adjective), gaily would be *dayly, *gayly (see again 
section 6.5). Conversely, there is one plural noun with a regular spelling 
but an irregular pronunciation: houses, which is pronounced /‘hauziz/ 
with irregular change of the stem-final consonant from /s/ to /z/ (if its 
pronunciation were regular it would be /'hausiz/ ‘haussiz’). 

But those quirks are tiny compared to the overall irregularities in the 
relationships between pronunciation and spelling. For many languages 
the complete set of both phoneme-grapheme and grapheme-phoneme 
correspondences could be set out on one page. The complexities of English 
spelling, especially of vowels, which entail that this book is so large are 
a measure of the task facing learners who wish to write correctly-spelt 
English and (try to) derive accurate pronunciations of English words from 
their written forms. 


2. The phonemes of 
spoken English 


2.1 Choosing an accent to analyse 


English is spoken with many accents, and the number of phonemes, and 
the exact sounds of many of them, vary across accents. In order to list 
phonemes, therefore, | first had to choose an accent to base my list on. 
Because this book deals with British spelling, the accent | have chosen 
is the British accent known to many linguists as ‘Received Pronunciation’ 
(RP). Recently some linguists have re-named it ‘Southern British Standard’ 
(SBS), or ‘Standard Southern British’, or even ‘General British’ (Cruttenden, 
2014), but | have retained the term RP because it is more widely known. 
The French textbook of English from which | learnt phonetic transcription 
(see below) in 1963 called it the accent ‘des milieux cultivés du sud-est 
anglais’, but that was too narrow a definition; though it is particularly 
prevalent in educated circles in the South-East of England, people from 
all over Britain have this accent, and their regional origins are therefore 
difficult to deduce from their accent. 
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2.2 How many phonemes? 


In RP there are 44 phonemes. Of these, 24 are consonant phonemes, 
and 20 are vowel phonemes. 

From the fact that there are many more phonemes in RP than the 26 
letters of the English alphabet, it is fairly clear that some phonemes have 
no predominant one-letter spelling. But for the purposes of this book a 
single way of representing each phoneme is needed. To do this, | use the 
symbols of the International Phonetic Alphabet (IPA). You will need to learn 
to read this system fluently in order to be able to use the rest of this book. 
Many words in this book are written in IPA alongside the conventional 
spelling, so that you can do some incidental learning as you read. The 
symbols for consonant phonemes are easier to learn (because most are 
ordinary letters, though some have unfamiliar values), so | start with them. 

For some purposes it is important to distinguish between voiceless 
consonant phonemes - those pronounced without vibrating the vocal 
cords - and voiced consonant phonemes - the rest. Those which are 
voiceless are so labelled in Table 2.1, and various sub-systems which rely 
on this distinction are discussed under /d, t/ in sections 3.5.2 and 3.5.7, 
under /1/ in section 5.4.3, and also in sections 3.7.8, 5.7.2 and 7.2.3. 

There is little difference in the number or pronunciation of the 
consonant phonemes across much of the English-speaking world, and 
much less variation than in the vowel phonemes - in fact, differences in 
vowel phonemes almost entirely define the differences between accents. 
However, two consonant phonemes which do not occur in RP (and are 
therefore not counted in my analyses of correspondences) but do occur in 
many Scots accents are mentioned in a few places: 

the voiceless counterpart of /w/, which is usually spelt <wh>, sounds 
roughly like ‘hw’, and is symbolised /m/; examples would be which, 
when 

the throat-clearing sound which is spelt <ch> in some Scottish words, 
e.g. dreich, loch, Sassenach, and German names like Schumacher (or 
<gh> in some Irish words, e.g. lough, or <kh> in transcriptions of 
some Russian names, e.g. Mikhail, and is symbolised /x/ - on no 
account to be confused with letter <x>, but | have not included this 
correspondence in my analysis because /x/ is not a phoneme of RP. 
See Notes to sections 9.9/15/19 and 10.33. 
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2.3 The consonant phonemes of Received 
Pronunciation 


Table 2.1 presents the IPA symbols for the 24 consonant phonemes of RP. 


TABLE 2.1: THE INTERNATIONAL PHONETIC ALPHABET SYMBOLS FOR THE 24 


CONSONANT PHONEMES OF THE RECEIVED PRONUNCIATION ACCENT OF ENGLISH. 


Consonant phonemes with doubled spellings’ which are rare in one-syllable words 


one-syllable words after a short vowel phoneme spelt with one letter 


/b/ as in the first sound of by /bat/ 

/d/ as in the first sound of dye /dat/ 

/g/ as in the first sound of goo /gu:/ 

/m/ as in the first sound of my /mat/ 

/n/ as in the first sound of nigh /nat/ 

/p/ as in the first sound of pie /pat/ voiceless 
/t/ as in the first sound of tie /tar/ voiceless 
/r/ as in the first sound of rye /rat/ 

Consonant phonemes with doubled spellings” which are regular at the end of 


/k/ as in the first sound of coo /ku:/ voiceless 
/tf/ as in the first sound of chew /tfur/ voiceless 
/f/ as in the first sound of few /fju:/ 

/c3/ as in the first sound of jaw /d3>:/ 

/\/ as in the first sound of law /lox/ 

/s/ as in the first sound of sue /sux/ voiceless 
/v/ as in the first sound of view /vjux/ 

/z/ as in the first sound of Zoo /zu:/ 

Consonant phonemes without doubled spellings 

/h/ as in the first sound of who /hu:/ 

/4/ as in the last sound of ring /rtq/ 

/S/ as in the third sound of fission /'ftfan/ voiceless 
/3/ as in the third sound of vision /'vizan/ 

/8/ as in the first sound of thigh /8ar/ voiceless 
/d/ as in the first sound of thy /Oat/ 

/w/ as in the first sound of well /wel/ 

/j/ as in the first sound of yell, union /jel, ‘ju:njan/ 


* For doubled spellings see section 3.2 and much of chapter 4. 
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2.4 The vowel phonemes of Received 
Pronunciation 


Table 2.2 presents the IPA symbols for the 20 vowel phonemes of RP and, as 
mentioned in section 1.12, the 2-phoneme sequence /jur/. 


TABLE 2.2: THE INTERNATIONAL PHONETIC ALPHABET SYMBOLS FOR THE 20 VOWEL 
PHONEMES OF THE RECEIVED PRONUNCIATION ACCENT OF ENGLISH, PLUS /ju:/. 


Short pure vowels 

/xe/ as in the first sound of ant /ent/ 
/e/ as in the first sound of end /end/ 
/1/ as in the first sound of ink /1nk/ 
/o/ as in the first sound of Ox /oks/ 
/A/ as in the first sound of up /Ap/ 

/v/ as in the second sound of pull /pul/ 
/a/ (schwa) | as in the first sound of about /2’baut 
Long pure vowels 

/ax/ as in the first sound of aardvark /'a:dvark/ 
/3:/ as in the first sound of earl /3xI/ 
/o:/ as in the whole sound of awe /o:/ 
/ux/ as in the first sound of ooze /urz/ 
ie as in the first sound of eel /ixl/ 
Special 2-phoneme sequence 

/jux/* as in the first two sounds of union /‘juxnjan/ 
Diphthongs 

Jet/* as in the first sound of aim /erm/ 
Jat/* as in the first sound of ice /ats/ 
Jav/* as in the first sound of oath /3v8/ 
/au/ as in the first sound of ouch /avtf/ 
/o1/ as in the first sound of oyster /‘dIsta/ 
/ea/ as in the whole sound of air /ea/ 
/1a/ as in the whole sound of ear /12/ 
/ve/ as in the second sound of rural /‘ruaral/ 


* These four vowel phonemes and /ju:/ are the ‘letter-name’ vowels - see 


sections 5.1, 5.7, 6.2 and 6.3. Phoneme /u:/ also belongs with them. 
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The last short pure vowel listed in Table 2.2, the /a/ phoneme, is heard in 
the first syllable of about /a'baut/ and the second syllable of oyster /'s1sta/. 
It is the least distinctive phoneme in English - think how little effort is 
needed to say it. However, that does not mean it is unimportant, because it 
has three special characteristics: 
in RP it occurs only in unstressed syllables, and (almost) never 
in stressed syllables (except that RP-speakers now increasingly 
pronounce because as /b1'kaz/ rather than /b1'koz/). In occurring only 
in unstressed syllables /a/ is unique among English vowel phonemes 
(but note that this applies only to English - in many other languages 
/a/ occurs in both stressed and unstressed syllables); 
in (my analysis /version of) RP it is the only short vowel phoneme which 
occurs word-finally; 
it is the most frequent phoneme of all in spoken English, in every 
accent, because a high proportion of unstressed syllables contain it. In 
RP, for example, it constitutes about 10% of running speech. 
Also uniquely, this phoneme has a special name (derived from Hebrew): 
schwa, or the schwa vowel. 

As stated in section 1.2, the phonetic symbols used in this book are 
identical to those used in the 18th (2011) edition of the Cambridge English 
Pronouncing Dictionary. They are also identical to those used in most of 
the eight editions of Gimson’s Pronunciation of English, including the 7th. 
However, as this book was nearing publication, my attention was drawn 
to the fact that, in the latest (eighth) edition of Gimson’s Pronunciation of 
English (Cruttenden, 2014), Cruttenden has introduced two changes: 

for the ‘short a’ sound, listed above as ‘/z/ as in the first sound of 
ant’, he now uses plain /a/ (for the reasons for this see his page xvii); 
for /ea/ as in the whole sound of air he now uses /€:/, on the grounds 
that this phoneme, in the mouths of most speakers of ‘General British’ 
(= RP), is no longer a diphthong but a long pure vowel. 

One of the current editors of the Cambridge English Pronouncing 
Dictionary, Prof. Jane Setters of the University of Reading, kindly told me 
that, although she and her fellow editors are aware of these changes and 
use them in their teaching, they do not propose to introduce them into the 
Dictionary. Since | wish this this book to parallel the Dictionary | have not 
adopted them either. 
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2.5 Polysyllabic words and word stress 


In all English words of two or more syllables, one of the syllables is spoken 
with heavier emphasis than the rest. For example: 

oyster is stressed on the first syllable 

about is stressed on the second syllable. 
In IPA transcriptions the stressed syllable is marked with a small vertical 
notch placed in front of it: /'‘D1sta, a'baut/. Analysing and marking word 
stress is not just an exercise; many phoneme-grapheme and grapheme- 
phoneme correspondences apply only in words of more than one syllable, 
some only in stressed syllables, others only in unstressed syllables. The 
clearest example is the occurrence of /a/ only in unstressed syllables. 
Occasionally for simplicity | use an acute accent on the ordinary spelling of 
a word to indicate stress, e.g. arithmetic (noun), arithmetic (adjective). 

The question of predicting from the written form of polysyllabic words 

where the stress falls on them is attempted, and largely failed, in section 
A.10 in Appendix A. 


3. The phoneme-grapheme 
correspondences of 
English, 1: Consonants 


3.1 The general picture: the regular spellings 
of English consonant phonemes 


This chapter can be summed up by saying that 13 of the 24 consonant 
phonemes of RP have highly regular spellings (though for two of these, 
/w, 9/, positional constraints have to be stated), while the other 11 have to 
be analysed according to position in the word. 

So, the 11 consonant phonemes /bdghmnoprt@0/ are regularly spelt 
<bdghmnoprtthth> respectively; /w/ (which occurs only before vowel 
phonemes and therefore does not occur word-finally) is regularly spelt <w> 
initially, <u> medially (but see the note below Table 3.1); and /n/ (which 
occurs only after short vowel phonemes and therefore does not occur 
initially) is regularly spelt <n> before /k, g/, however spelt, otherwise <ng>. 

The main regularities for the other 11 consonant phonemes are 
summarised in Table 3.1, by position in the word. For seven phonemes final 
position has to be subdivided, and final /s, k/ have a further sub-subdivision. 
The entries for /dj, s, k/ blur the distinction between phonemes and 
graphemes in defining word positions - for more detail on these phonemes’ 
complicated correspondences, and for the 2-phoneme grapheme <x>, see 
sections 3.7.1, 3.7.4 and 3.7.6 and Tables 3.3 and 3.4. 
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TABLE 3.1: MAIN CORRESPONDENCES OF THE 11 CONSONANT PHONEMES WITH 
VARIABLE SPELLINGS, BY POSITION IN THE WORD. 


Position in word 
Phoneme Initial Medial Final 
/J/ sh ti sh 
/f/ f f ff 
/v/ Vv Vv ve 
/j/ within /jux/ See Table 5.1 
/j/ elsewhere y i* (does not occur) 
in monosyllables 
after a short vowel ’ 
. otherwise 
spelt with one 
letter 
/\/ | Il L 
ch 
/tf/ but <t> t tch ch 
before /u:/ 
/3/ (rare) si (does not occur) ge 
/Z/ z s ZZ iS 
<g> before 
. <e, i, y>, 
/c3/ j ‘ dge ge 
otherwise 
<j> 
in other 
monosyllables 
/s/ s s ss <ce>; in 
polysyllables 
<s> 
c Cc in other 
monosyllables 
/k/ ck <k>; in 
but <k> before <e, i, y> polysyllables 
<c> 


represented in the spelling at all - see sections 3.8.7-8. 


* N.B. Many occurrences of medial /j/ (and some of medial /w/) are actually not 
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3.2 Order of description 


In sections 3.5, 3.7 and 3.8 | set out the consonantal phoneme-grapheme 
correspondences of English, under the consonant phonemes listed in the 
order in which they appear in Table 2.1 in chapter 2. 

The consonant phonemes fall into two main categories, those which 
have a doubled spelling, such as <bb> for /b/ in rabbit, and those which 
do not. Within those which do have a doubled spelling there is a further 
important distinction, between those whose doubled spellings are rare in 
one-syllable words and those whose doubled spellings are regular at the 
end of one-syllable words after a short vowel phoneme spelt with one letter 
(Crystal, 2012, especially chapters 7 and 8, explains how this division goes 
back many centuries). Sections 3.5 and 3.7 cover these two categories of 
consonant phonemes with doubled spellings, and section 3.8 those which 
do not have a doubled spelling. 

This trichotomy (the Greek etymology of this word officially means 
‘cutting into three’, but unofficially could also mean ‘haircut’ - how neat 
is it that the word meaning ‘cutting into three’ could also mean ‘splitting 
hairs’?) does not quite accommodate /r/. It does have a doubled spelling 
(<rr>) and therefore does not belong in section 3.8 (phonemes without a 
doubled spelling). But /r/ does not occur word-finally in RP, so is not even 
a candidate for section 3.7 (phonemes whose doubled spellings are regular 
at the end of one-syllable words after a short vowel phoneme spelt with 
one letter). Yet /r/ spelt <rr> is not just rare in one-syllable words - it is 
non-existent - so it might seem not to fit into section 3.5 (phonemes whose 
doubled spellings are rare in one-syllable words) either. However, section 
3.5 is where I have put it, on the grounds that (a) there are some medial 
examples of /r/ spelt <rr>, e.g. error; (b) many other examples of /r/ spelt 
<rr> arise from suffixation, e.g. preferring, referral; (c) in these respects /r/ 
is similar to the other phonemes in section 3.5. 

Within each group | list the phonemes in alphabetical order of the 
letter(s) comprising their basic spellings, except that in section 3.5 /r/ is 
dealt with after /t/; /r/ is dealt with last because that leads on naturally 
to the treatment in section 3.6 of a special process involving /r/, namely 
/r/-linking, hence the interruption in the order of sections. 

Under each consonant phoneme | deal with the spellings in this order: 
1) The basic grapheme. In my opinion, each of the 24 consonant phonemes 

of English has a basic grapheme, the one that seems most natural as 

its spelling. The identification of <si> as the basic grapheme for /3/ 
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2) 


3) 


4) 
5) 
6) 


7) 


may seem curious - but this is the least frequent phoneme in English 
speech and <si> is its most frequent spelling. As you will see from 
the percentages at the beginning of each section, the basic grapheme 
is also, in 20 cases, the most frequent spelling of that phoneme - the 
exceptions are /z, dz, J, j/. 

Other graphemes which are used for the phoneme with reasonable 
frequency. By reasonable frequency | mean at least 5 per cent of the 
occurrences of the phoneme in running text. 

The doubled spelling, if the phoneme has one - 16 of the 24 consonant 
phonemes do (indeed, a few have more than one). Most doubled 
consonant spellings consist of the basic single-letter grapheme written 
twice, but some have a different pattern. Most of the doubled spellings 
are quite rare in stem words. For some guidance on when to spell a 
consonant double see chapter 4. None of the doubled spellings of 
English consonant phonemes ever occur in word-initial position (with 
the two exceptions noted under /I/ in sections 3.7.5 and 4.1), so word- 
initial position is not mentioned in the entries about doubled spellings 
in this chapter (except under /I/). 

The doubled spelling plus final <e>, if the phoneme has such a spelling. 
Oddities, graphemes which are used to spell that phoneme only rarely. 

Any 2-phoneme graphemes in which the phoneme is represented. Almost 
all the 2-phoneme graphemes are also Oddities, but a few belong to the 
main system (see section 3.4) and are included there. 

Any 3-phoneme grapheme in which the phoneme is represented. Both 

3-phoneme graphemes are definitely Oddities. 


Some entries end with Notes, and a few have Tables. 


3.3 Frequencies 


Under most phonemes | give the frequency of occurrence of each major 


grapheme as a spelling of the phoneme, using the information in Edward 


Carney’s massive study A Survey of English Spelling (1994). He gives two 


frequencies for most phoneme-grapheme correspondences: 


text frequency, that is, the frequency with which the correspondence 
occurs when you count all the correspondences in a large set of 
pieces of continuous prose, but discounting derived forms of stem 
words, e.g. past tenses, and all function words, e.g. of, is, there, 
where. Because Carney lemmatised his corpus (that is, reduced all the 
words to stem forms), his text frequencies for doubled consonants 
are probably systematically underestimated, since large numbers of 
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occurrences of doubled consonant letters arise from suffixation - see 

sections 4.2 and 4.3.1; 

lexical frequency, that is, the frequency with which the correspondence 

occurs when you count all and only the correspondences in a dictionary. 
Usually the two frequencies are similar, but where a particular correspondence 
occurs in only a few words but those words are very common, the text 
frequency will be high and the lexical frequency low (and vice versa where 
a correspondence occurs in many words but those words are rare). For this 
chapter and chapter 5 I’ve used only Carney’s text frequencies since those 
(mainly) represent what readers encounter. However, my lists of examples 
range far and wide within English vocabulary, and take in words which are 
So rare that they certainly did not contribute to Carney’s text frequencies. 
An odd category here is words in which /1a/ is spelt <ier> - this category 
is never mentioned by Carney; presumably no such words turned up in the 
corpus he compiled and analysed. 

| give no frequencies for doubled spellings plus final <e> since these 

are all rare, and in most cases the frequencies for the Oddities are lumped 
together. 


3.4 The main system and the rest 


Under each phoneme | separate the correspondences with graphemes into 
what | consider to be the main system and the rest (this distinction is very 
similar to that between major and minor units postulated by Venezky, 1970: 
52-55). The correspondences which | include in the main system are those 
which seem to me to operate as part of larger regularities, even though 
pretty rarely as absolute rules. For the consonant phonemes the larger 
regularities comprise the basic correspondences, the correspondences 
which have reasonable frequency as I’ve defined it above, and the doubled 
spellings, but not the doubled spellings plus <e>, the 2-and 3-phoneme 
graphemes (except a few 2-phoneme graphemes which are of reasonably 
high frequency), or the Oddities. In this chapter (and in chapters 5, 9 and 
10) correspondences which have reasonable frequency are shown in 9-point 
type, the rest in smaller 7.5-point type. 

Three quite rare correspondences are, however, included in the main 
system - /k/ spelt <q>, /3/ spelt <ge>, and /u:/ spelt <ue>. For /k/ 
spelt <q> this is because <q> would otherwise not appear in the main 
system at all, but <q> is a grapheme of written English and therefore has 
to be included; also, the 2-phoneme sequence /kw/ is mainly spelt <qu>. 
/3/ spelt <ge> is needed to complete the pattern of correspondences in 
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word-final position - see Table 3.1. And although /u:/ spelt <ue> is very 
rare, | found it necessary to include it in the main system because the 
mirror-image correspondence (<ue> pronounced /u:/) is one of only two 
frequent correspondence of <ue> - see section 10.37. 


3.5 Consonants with doubled spellings which 
are rare in one-syllable words: 
/bdgmnpty, plus /r/ 


For the incluson of /r/ in this section see section 3.2. 

Despite their rarity in stem words, the doubled spellings of these 
consonant phonemes arise very frequently from suffixation, e.g. rubbed, 
budding, begged, skimmed, skinned, hopped, pitted, preferring (see sections 
4.2 and 4.3.1). 


3.5.1 /b/ as in by 


THE MAIN SYSTEM 


Basic grapheme <b> 98% e.g. rabid 
Other frequent graphemes (none) 


Doubled spelling <bb> <1% medially, regular before final 
/al/ spelt <-le> after a short 
vowel spelt with a single letter, 
e.g. babble - see section 4.3.3; 
there are also independent 
medial examples, e.g. abbey, 
abbot, bobbin, cabbage, dibber, 
hobbit, hobby, hubbub, rabbi, 
rabbit, ribbon, rubber, rubbish, 
Sabbath, shibboleth, stubborn - 
see sections 4.3.4 and 4.4.5-6; 
word-finally, only in ebb - see 
section 4.3.2 
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THE REST 


Doubled spelling + <e> 


Oddities 
<bh> 
<bu> 
<pb> 
2-phoneme graphemes (none) 


NOTES 


(does not occur) 
1% in total 


only in abhor and its derivatives abhorred, 
abhorrent, plus bhaji, bhang(ra), bhindi, 
Bhutan and a few other rare words from 
the Indian sub-continent. <b, h> are 
usually separate graphemes at a morpheme 
boundary, as in clubhouse, subheading 


only in build, buoy, buy. See Notes 


only in the compound words cupboard, 
raspberry, plus Campbell 


For the compound words gooseberry /'guzbri:/), raspberry /‘ra:zbri:/), 


strawberry /'stra:bri:/) see section 6.10. 


| analyse <bu> in build, buoy, buy as a grapheme spelling /b/ because 


this is more economical than adding /1/ spelt <ui>, /31/ spelt <uoy> and 


/at/ spelt <uy> to the list of graphemes; cf. <gu> under /g/, section 3.5.3, 


and <cu> under /k/, section 3.7.1. 
3.5.2 /d/ as in dye 


THE MAIN SYSTEM 


Basic grapheme <d> 


Other frequent grapheme <ed> 


Doubled spelling <dd> 2% 


98% e.g. bud 


(not counted in percentages) 
See Note 


medially, regular before final 

/al/ spelt<-le> after a short 

vowel spelt with one letter, e.g. 
griddle - see section 4.3.3; other 
medial examples include addictive, 
additive, adduce, bladder, 
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buddy, cheddar, fodder, judder, 
ladder, midden, rudder, ruddy, 
shoddy, sodden, sudden, teddy, 
toddy, widdershins - see sections 
4.3.4 and 4.4.5-6; perhaps also 
the compound word granddad, 
but see section 4.4.7; word-finally, 
only in add, odd, rudd, Sudd - on 
add, odd see section 4.3.2 


THE REST 


Doubled spelling + <e> (does not occur) 
Oddities <1% in total 
<bd> only in bdellium 
<ddh> only in Buddha and derivatives, saddhu 


<de> only in aide, blende, blonde, horde and 
(for) bade pronounced /(fa')bed/ (also 
pronounced /(fa')betd/ with <d> alone 
spelling /d/ and <a.e> spelling /e1/). The 
<e> in blonde marks it French-style as 
feminine (masculine: blond) 


<dh> only in a few loanwords and names from 
the Indian subcontinent, e.g. dhobi, dhoti, 
dhow, Gandhi, jodhpurs, sandhi, Sindh 


2-phoneme graphemes (none) 


NOTE 


/d/ is almost always spelt <ed> in past forms of regular verbs ending in a 
voiced consonant other than /d/ or a vowel, e.g. ebbed, flowed. The only 
exceptions are laid, paid which would (if they were spelt regularly) be “layed, 
“payed - cf. delayed, played and sections 5.7.1 and 6.5. See also the entry 
for <ed> in chapter 10, section 10.15. 
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3.5.3 /g/ as in goo 


THE MAIN SYSTEM 


Basic grapheme <g> 92% e.g. beg 
Other frequent graphemes (none) 


Doubled spelling <gg> 2% medially, regular before final 
/al/ spelt <-le> after a short 
vowel spelt with one letter, e.g. 
muggle - see section 4.3.3; 
other medial examples include 
aggressive, beggar, dagger, 
doggerel, haggis, jagged, 
maggot, nugget, ragged, rugged, 
rugger, sluggish, trigger - see 
sections 4.3.4 and 4.4.5-6; 
word-finally, only in egg - see 


section 4.3.2 
THE REST 
Doubled spelling + <e> (does not occur) 
Oddities 2% in total 
<ckgu> only in blackguard /'blegad, 'blega:d/ 
<gh> word-final only in ugh; otherwise only 


in afghan, aghast, burgher, ghastly, 
ghat, ghee, gherkin, ghetto, ghillie 
(also spelt gillie), ghost, ghoul, ogham, 
sorghum and a few more rare words 


<gu> word-initially, only in guarantee, 
guard, guerrilla, guess, guest, guide, 
guild, guilder, guile, guillemot, 
guillotine, guilt, guinea, guise, guitar, 
guy and a few more rare words; 
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2-phoneme graphemes 


<gue> 


/gz/ 


(1) spelt <x> 


medially, only in baguette, beguine, 
dengue, disguise, languor (the <u> 
surfaces as /w/ in languid, languish - see 
section 7.2) and suffixed forms of a few 
words in next category, e.g. cataloguing; 
phonemically word-final only in brogue, 
drogue, fatigue, fugue, intrigue, plague, 
rogue, vague, vogue and a few more 

rare words where the final written <e> 

is part of a split digraph with the vowel 
letter preceding the <g> - see also next 
paragraph, and Notes 


only word-final and only in analogue, 
catalogue, colleague, decalogue, 
demagogue, dialogue, eclogue, 
epilogue, ideologue, league, monologue, 
morgue, pedagogue, prologue, 
prorogue, synagogue. |In some of the 
words ending <-ogue> US spelling 
has <-og>, which is simpler in the 
stem forms but means that in, e.g., 
cataloging the first <g> (less regularly) 
spells /g/ before <i>, a problem which 
the spelling with <u> avoids. The only 
word in which final <g, u, e> are all 
separate graphemes is segue /'segwe1/ 


For all of these see Notes 
4% 


only in some polysyllabic words 

of Latin origin, namely anxiety 
pronounced /zxn'gzarjiti:/ (also 
pronounced /zn'zarjiti:/), auxiliary, 
exact, exaggerate, exalt, exam(ine), 
example, exasperate, executive, 
executor, exemplar, exemplify, exempt, 
exert, exigency, exiguous, exile, exist, 
exonerate, exorbitant, exordium, 
exuberant, exude, exult, plus exotic from 
Greek and a few more rare words; also 
in Alexandra, Alexander and becoming 
frequent in exit pronounced /'‘egzit/ 
(also pronounced /'‘eksit/). For anxiety 
see also under /n/ in section 3.8.2 
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(2) spelt <xh> only in about 7 polysyllabic words 
of Latin origin: exhaust(ion), exhibit, 
exhilarat-e/ ion, exhort, exhume - but 
in some derivatives <xh> spells 
/eks/, e.g. exhibition, exhortation, 
exhumation 


/g3/ spelt <x> only in luxuriance, luxuriant, luxuriate, 
luxurious 


NOTES 


In blackguard (also spelt blaggard), guarantee, guard the <u> is technically 
redundant because <ckg, g> would spell (and be pronounced) /g/ without 
it. But in all the other words with <gu> the <u> has to be there in order to 
prevent the <g> appearing to spell (and be pronounced) /d3/. It’s because 
guild, guy must be analysed this way that | analyse build, buy (and by 
extension buoy) as having /b/ spelt <bu> (see section 3.5.1, and cf. <cu> 
under /k/, section 3.7.1). 

The regular 2-grapheme spelling of /gz/ is <gs>, e.g. dogs. The 
sequence <gz> seems to occur only in zigzag. 

The 2-phoneme sequence /g3/ seems to occur only in the four words 
listed above and to have no 2-grapheme spelling. 

The 2-phoneme sequence /gw/ is almost always spelt <gu>, e.g. in 
anguish, distinguish, extinguish, guacamole, guano, guava, iguana, 
language, languish, linguist, penguin, sanguine, segue, unguent. Exception: 
wigwam. The converse does not hold - most occurrences of <gu> are 
pronounced either as /g/ or as 2 phonemes (/g/ plus a vocalic pronunciation 
of <u>) - see section 9.15. 

For <go> in allegory, category see section 6.10. 


3.5.4 /m/ as in my 


THE MAIN SYSTEM 


Basic grapheme <m> 96% e.g. sum 


Other frequent graphemes (none) 
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Doubled spelling <mm> 3% 
THE REST 
Doubled spelling + <e> <mme> 


Oddities (all word-final only) 


<gm> 


<mb> 


<mbe> 


medially, does NOT occur 
before final /al/ spelt 

<-le>; medial examples 
include comma, commune, 
cummerbund, hammock, 
hummock, immense, plummet, 
rummage, slummock, summit 
and some derived forms, e.g. 
dia/pro-grammatic, immodest - 
see sections 4.3.4 and 4.4.5-6; 
never word-final 


now only in oriflamme and (non- 
computer) programme since gram 
and its derivatives are no longer spelt 


“gramme, etc. 


<1% in total 


only in apophthegm, diaphragm, 
epiphragm, paradigm, phlegm, 
syntagm. /g/ surfaces in some 
derivatives: paradigmatic, phlegmatic, 
syntagma(tic) - see section 7.2 


only in dithyramb, lamb; climb, 
limb; aplomb, bomb, catacomb, 
comb, coomb, coxcomb, coulomb, 
hecatomb, rhomb, tomb, womb, 
crumb, dumb, numb, plumb, rhumb, 
succumb, thumb and a few more 
very rare words. /b/ surfaces in 
some derivatives: dithyrambic, 
bombard ier), bombast(iod, 
rhomb-ic/us, crumble and supposedly, 
according to some authorities, in 
thimble - see section 7.2 


only in buncombe (‘nonsense’; also 
spelt bunkum), co(o)mbe (‘short 
valley’; also spelt coomb); and 
contrast flambe /'flomber/, where 
<m, b, e> are all separate graphemes 
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<me> 


<mn> 


<nd> 


2-phoneme grapheme /am/ 
spelt <m> 


NOTE 


never initial; mainly word-final and 
there only in become, come, some, 
welcome and the adjectival suffix 
/sam/ spelt <-some>, e.g. handsome 
(contrast hansom); medially only 

in camera, emerald, omelette, 
ramekin pronounced /'reemkin/ (also 
pronounced /'reemikin/) - see section 
6.10 - and Thames 


only in autumn, column, condemn, 
contemn, damn, hymn, limn, solemn. 
/n/ surfaces in some derivatives: 
autumnal, columnar, columnist, 
condemnation, contemner, damnable, 
damnation, hymnal, solemnity - see 
section 7.2 


only in sandwich pronounced 
/'‘semwidz/ (also has a ‘regular’ 
spelling pronunciation /'sendwitf/) 


only word-final, e.g. chasm, 
enthusiasm, orgasm, phantasm, 
pleonasm, sarcasm, spasm, several 
words ending in -plasm (e.g. 
ectoplasm), chrism, prism, schism and 
all the many derived forms ending 

in -ism, macrocosm, microcosm, 
abysm, aneurysm (also spelt aneurism), 
cataclysm, paroxysm, algorithm, 
rhythm and a few other very rare 
words; also in film pronounced 
/'ftlam/ in some Irish accents. See Note 


In all but the last three of the words just listed with word-final /am/ spelt <-m> 


the preceding phoneme is /z/ spelt <s>, so the regular spelling of word-final 


/zam/ is <-sm> (only exception: bosom). This is one of only a handful of 


cases where the spelling of a final syllable is more predictable as a whole than 


from its separate phonemes, which here would predict (for example) “chasam, 


“prisom, etc. However, word-final /am/ with other preceding phonemes has 


various 2-grapheme spellings in, e.g., alyssum, balsam, besom, fathom (but 


contrast the 1-grapheme spelling in algorithm, rhythm), gypsum, hansom, 


lissom, opossum, ransom, transom and all the adjectives ending <-some>. 
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3.5.5 /n/ as in nigh 


THE MAIN SYSTEM 


Basic grapheme <n> 97% e.g. tin 


Other frequent (none) 
graphemes 


Doubled spelling <nn> <1% medially, does NOT occur before final 
/al/ spelt <-le>; medial examples 
include anneal, annual, annul, 
biennale, binnacle, Britannic, cannibal, 
chardonnay, cinnabar, cinnamon, ennui, 
innocent, punnet, tannic, tinnitus, 
tintinnabulation, zinnia - see sections 
4.3.4 and 4.4.5-6; word-finally, only in 
Ann, djinn, Finn, inn - on Ann, inn see 
section 4.3.2 


THE REST 


Doubled spelling + <e> <nne> only word-final and only in Anne, cayenne, 
comedienne, cretonne, doyenne, tonne and a few 
other rare words 


Oddities 3% in total 
<dne> only in Wednesday 


<gn> word-initially, only in gnarl, gnash, gnat, gnaw, 
gneiss, gnome, gnosis, Gnostic and gnu analysed 
as /n/ spelt <gn> plus /ju:/ spelt <u>; medially, 
only in cognisance (also pronounced with /gn/), 
physiognomy, recognise pronounced /'rekanaiz/ 
(usually pronounced /'rekagnatz/); word-finally, 
only in align, arraign, assign, benign, campaign, 
coign, condign, consign, deign, design, ensign, feign, 
foreign, impugn and a few other very rare words 
in -pugn, malign, reign, resign, sign, sovereign, 
thegn; also phonemically word-final in champagne, 
cologne where the final written <e> is part of a 
split digraph with the letter before the <g> spelling 
a diphthong. /g/ surfaces in some derivatives: 
agnostic, diagnosis, prognosis, malignant, 
pugnacious, repugnant, assignation, designation, 
resignation, signal, signature - see section 7.2 
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<gne> only word-final and only in cockaigne, epergne, 
frankalmoigne /ka'kein, 1'p3in, ‘freqkzelmoin/ 


<kn> 1% never word-final; medially, only in acknowledge, 
knick-knack; otherwise only word-initial and only 
in knack(er(s)), knap, knave, knead, knee, knell, 
knew, knick(er(s)), knickerbocker, knick-knack, 
knife, knight, knit, knob, knobbly, knock, knoll, knot, 
know ledge), knuckle and a few more very rare words 


<mn> only word-initial and only in mnemonic, mnemonist. 
/m/ surfaces in amnesia, amnesty - see section 7.2 


<nd> only in grandfather, Grandma (hence the frequent 
misspelling “Granma - cf. section 4.4.7 on 
Gran(d) dad), handsome (cf. hansom (cab)), 
landscape 


<ne> non-finally, only in vineyard (and even there it’s 
stem-final), vulnerable pronounced /'valnrabal/ 

- see also Notes and section 6.10 (I refuse to 
analyse the alternative pronunciation /'vanrabal/ 
with loss of the first /I/ because it would add 
an otherwise not-needed grapheme <Ine> to 
the inventory); otherwise only word-final and 
only in about 35 words, namely borne, bourne, 
bowline, Catherine, clandestine pronounced 
/klan'desttn/ (also pronounced /'klandastatn/), 
cocaine, compline, crinoline, demesne, (pre)destine, 
determine, discipline, engine, ermine, examine, 
famine, feminine, genuine, gone, groyne, heroine, 
hurricane pronounced /‘haritkan/ (also pronounced 
/‘hartketn/), i/lumine, intestine, jasmine, marline, 
masculine, medicine, migraine, moraine, none, 
peregrine, ptomaine, saccharine, sanguine, scone 
pronounced /skon/ (also pronounced /skaun/), 
shone, urine, vaseline, wolverine. |n all these words 
the <e> is phonographically redundant, in that 
its removal would not affect the pronunciation. 
However, without <e> done, none would 
become don and the prefix non- (and changing 
their spellings to dun, nun would cause other 
confusions). Also, the <e> keeps borne, heroine 
visually distinct from born, heroin 


<ng> only in length, lengthen, strength, strengthen 
pronounced /len®, '‘len@an, stren®O, 'strenOan/. See 
also under /k, n/, sections 3.7.1, 3.8.2 


<nt> only in croissant, denouement, rapprochement 
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<nw> only in gunwale 


<pn> only word-initial and only in words derived from 
Greek trvedux pneuma (‘breath’) or Tvebwwv 
pneumon (‘lung’), e.g. pneumatic, pneumonia 


2-phoneme graphemes /an/ only in Haydn (| mention him in memory of Chris 
spelt <n> Upward of the Simplified Spelling Society) and most 

contractions of not with auxiliary verbs, i.e. isn’t, 
wasn’t, haven’t, hasn’t, hadn’t, doesn’t, didn’t, 
mayn’t, mightn’t, mustn’t, couldn’t, shouldn't, 
wouldn’t, oughtn’t, usedn’t, some of which are 
rare to the point of disuse, plus durstn’t, which 
is regional/comic; in all of these except mayn’t 
the preceding phoneme is a consonant. Other 
contractions of not with auxiliary verbs (ain’t, 
aren’t, can’t, daren’t, don’t, shan’t, weren’t, won’t), 
i.e. all those with a preceding vowel phoneme 
(except mayn’t) are monosyllabic (though some 
Scots say /'dearant/ with a preceding consonant 
and linking /r/ and therefore two syllables). 
Curiously, innit, being a contraction of isn’t it, 
reduces isn’t to a single syllable. See Notes 


/nj/ see under /j/, section 3.8.8 
spelt <gn> 


NOTES 


/an/ has several 2-grapheme spellings, e.g. in cotton, ruffian, written. 
For <ne> in confectionery, generative, stationery, vulnerable see section 
6.10. 


3.5.6 /p/ as in pie 


THE MAIN SYSTEM 


Basic grapheme <p> 95% e.g. apt 
Other frequent (none) 


graphemes 


Doubled spelling <pp> 5% medially, regular before final /al/ spelt 
<-le> after a short vowel spelt with one 
letter, e.g. apple: other medial examples 
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THE REST 


Doubled spelling+<e> <ppe> 
Oddities 


<b> 


<bp> 
<gh> 


<pe> 


<ph> 


2-phoneme graphemes (none) 


3.5.7 /t/ as in tie 


THE MAIN SYSTEM 


Basic grapheme <t> 96% 


Other frequent <ed> 
grapheme 


Doubled spelling <tt> 3% 


include apply, apprehend, cappuccino, 
dapper, frippery, hippodrome, 
hippopotamus, guppy, opponent, oppose, 
opposite, scupper, supper, supply, support 
- see sections 4.3.4 and 4.4.5-6; word- 
finally, only in Lapp 


only in grippe, steppe 
<1% in total 


only in presbyterian pronounced 
/prespr'trarisjan/ (also pronounced 
/prezbr'trari:jan/) 


only in subpoena /sa'pi:na/ 
only in misspelling of hiccup as *hiccough 


only in canteloupe, troupe, plus opera in rapid 
speech - for <pe> in opera see section 6.10 


only in diphtheria, diphthong, naphtha, 
ophthalmic and shepherd. The first four also have 
pronunciations with /f/ - e.g. /'dif8on/ versus 
/‘dip80n/ 


e.g. rat 


(not counted in percentages) See Notes 


medially, regular before final /al/ spelt 
<-le> after a short vowel spelt with one 
letter, e.g. rattle; other medial examples 
include attention, attract, attribute, 
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THE REST 


Doubled spelling + <e> 


Oddities 


<tte> 


<bt> 


<ct> 


<dt> 


<phth> 


<pt> 


<te> 


battalion, battery, butter, button, 
buttress, chitterlings, falsetto, glutton, 
jitter(s), mattress, rattan, smattering, 
tattoo, tittup - see sections 4.3.4 and 
4.4.5-6; word-finally, only in bott, 
boycott, butt, matt, mitt, mutt, nett, putt, 
watt. See also Notes 


only word-final and only in about 23 stem words, 
namely baguette, brunette, cassette, coquette, 
corvette, croquette, epaulette, etiquette, garrotte, 
gavotte, gazette, maisonette, omelette, oubliette, 
palette, pipette, pirouette, roulette, serviette, 
silhouette, toilette, vignette, vinaigrette, anda 
few derived forms, e.g. cigarette, launderette, 
rosette, statuette, suffragette, and some other 
rare words. In latte <tt, e> represent separate 
phonemes, as do <u.e, tt> in butte 


1% in total 


only in debt, doubt, subtle. /b/ surfaces in debit, 
indubitable, subtility - see section 7.2 


only in Connecticut, indict, victualler, victuals. 
/k/ surfaces in indiction - see section 7.2 


only in veldt 


only in phthisic, phthisis pronounced 
/‘tarstk, 'tarsis/ 


only in Deptford, ptarmigan, pterodactyl (Greek, 
= ‘wing finger’), pterosaur (Greek, = ‘wing lizard’), 
Ptolem-y/aic, ptomaine, receipt and a few more 
very rare words. /p/ surfaces in archaeopteryx, 
helicopter (Greek, = ‘ancient wing, spiral wing’), 
reception, receptive - see section 7.2 


mainly word-final and in that position in at least 
120 words, namely 

- ate pronounced /et/ (also pronounced /eit/, 
which requires a different analysis: /t/ spelt <t> 
and /e1/ spelt <a.e>), Bacchante, composite, 
compote, confidante, debutante, definite, detente, 
dirigiste, enceinte, entente, entracte, exquisite, 
favourite, granite, hypocrite, infinite, minute 
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(‘sixtieth of an hour’), opposite, perquisite, 
plebiscite, pointe, requisite, riposte, route, svelte 
- about 30 nouns/adjectives in /at/ spelt <-ate> 
where the verbs with the same spelling are 
pronounced with /ert/, e.g. advocate, affiliate, 
aggregate, alternate (here with also a difference 
in stress and vowel pattern: noun/ adjective 
pronounced /):I't3:nat/, verb pronounced 
/‘d:ltanert/), animate, appropriate, approximate, 
articulate, associate, certificate, consummate 
(here with also a difference in stress and vowel 
pattern: adjective pronounced /kan'samat/, 
verb pronounced /'konsjamert/), coordinate, 
curate (here with also a difference in meaning 
and stress: noun (‘junior cleric’) pronounced 
/‘kjuarat/, verb (‘mount an exhibition’) 
pronounced /kjua'reit/), degenerate, delegate, 
deliberate (here with also a difference in syllable 
structure: adjective /dr'lrbrat/ with three 
syllables and an elided vowel - see section 6.10; 
verb /dr'lzbarert/ with four syllables), designate, 
desolate, duplicate, elaborate, estimate, 
expatriate, graduate, initiate, intimate, legitimate, 
moderate, pontificate (here with unrelated (?) 
meanings: noun (‘pope’s reign’) pronounced 
/pon'tiftket/, verb (‘speak pompously’) 
pronounced /pon'tiftkeit/), precipitate (but here 
only the adjective has /at/; the noun as well as 
the verb has /eit/), predicate, separate (here too 
with a difference in syllable structure: adjective 
/‘seprat/ with two syllables and an elided vowel 
- see section 6.10; verb /'separert/ with three 
syllables), subordinate, syndicate, triplicate. 

In the verbs and the many other nouns and 
adjectives with this ending pronounced /eit/, the 
<e> is part of the split digraph <a.e> spelling 
/e1/ and the /t/ is spelt solely by the <t> 

- a further set of at least 60 nouns/adjectives 
(some of which are derived forms) in /at/ spelt 
<-ate> with no identically-spelt verb, e.g. 
accurate, adequate, agate, appellate, celibate, 
chocolate, climate, collegiate, conglomerate, 
(in)considerate, consulate, delicate, desperate, 
(in)determinate, directorate, disconsolate, 
doctorate, electorate, episcopate, extortionate, 
fortunate, illegitimate, immaculate, immediate, 
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2-phoneme graphemes 


<th> 


<tw> 


/t8/ 

spelt <th> 
/ts/ 

(1) 


spelt <z> 


(2) 


spelt <zz> 


inanimate, in(sub)ordinate, inspectorate, 
intricate, inviolate, (bacca)laureate, legate, 

(i) literate, novitiate, obdurate, palate, particulate, 
(com/ dis-)passionate, private, profligate, 
proletariate (also spelt proletariat), 

(dis) proportionate, protectorate, proximate, 
roseate, senate, surrogate, (in)temperate, 
triumvirate, ultimate, (in)vertebrate (a few of 
these words do have related but not identically- 
spelt verb forms with <-ate> pronounced /elt/: 
animate, legitimate, mediate, subordinate, violate) 
- possibly just one word where both noun and 
verb have <-ate> pronounced /at/: pirate 

- <te> spelling /t/ also occurs medially in a few 
words in rapid speech, e.g. interest, literacy, 
literal, literary, literature, sweetener, veterinary - 
see section 6.10 

In all cases where /at/ is spelt <-ate> the <e> is 
phonographically redundant (that is, it does not 
indicate a ‘long’ pronunciation of the preceding 
vowel letter and could therefore be omitted from 
the spelling without altering the pronunciation; 
hence | have not analysed such words as having 
/3/ spelt <a.e> and /t/ spelt <t>), but in two 
cases it makes the words visually distinct from 
words without the <e> and with an unrelated 
meaning: point, rout. 

Carney does not recognise <te> as a spelling of 
/t/ and this probably means that percentages for 
my analysis would be slightly different from his 


only in Thai, thali, Thame, Thames, Therese, 
Thomas, thyme, Wrotham /'ru:tam/ 


only in two and derivatives, e.g. twopence, 
twopenny. /w/ surfaces in between, betwixt, twain, 
twelfth, twelve, twenty, twice, twilight, twilit, twin 
- see section 7.2 


only in eighth. See section 4.4.7 


only in Alzheimer’s, bilharzia, Nazi (but Churchill 
said /'na:zi:/), scherzo, schizo(-) 


only in intermezzo, paparazzi, pizza, pizzicato 
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NOTES 


/t/ is always spelt <ed> in past forms of regular verbs ending in a voiceless 
consonant other than /t/, e.g. walked. See also the entry for <ed>, section 
10.15. 

/ts/ also has 2-grapheme spellings, the regular one being <ts>, plus the 
Oddity <tz> - for the latter see under /s/, section 3.7.6. 


3.5.8 /r/ as in rye 


Occurs only before a vowel phoneme (in RP). 


THE MAIN SYSTEM 


Basic grapheme <r> 94% e.g. very 


Other frequent (none) 
graphemes 


Doubled spelling <rr> 4% medially, does NOT occur before final 
/al/ spelt <-le> and arises mainly 
from suffixation (see Notes), but there 
are some independent examples, e.g. 
arroyo, barrow, berry, borrow, burrow, 
carrot, derrick, garrotte, guerrilla, 
herring, horrid, hurry, lorry, mirror, 
(to)morrow, parrot, porridge, scurrilous, 
serrate, sorry, squirrel, stirrup, terrine, 
warrant, wherry, worrit, worry - see 
sections 4.3.4 and 4.4.5-6; never word- 
final as a separate grapheme - see 


Notes 

THE REST 

Doubled spelling + <e> (does not occur as a Spelling of /r/ - but see 
Notes) 

Oddities 2% in total 


<re> only in forehead pronounced /'forid/ 
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<rh> only word-initial and only in a few words 
mainly of Greek origin, namely rhapsody and 
several other words beginning rhapsod-, rhea, 
rheme, rhesus, rhetor(id, rheum(ati-c/sm), 
rhinestone, rhinoceros and several other words 
beginning rhin(o)-, rhizome and several other 
words beginning rhizo-, rhododendron and 
several other words beginning rhodo-, rhodium, 
rhomb-ic/us, the Greek letter name rho, rhotic, 
rhubarb, rhyme, rhythm and a few other rare 
words 


<rrh> only medial and only in a few words of Greek 
origin, namely amenorrhoea, arrhythmia, 
cirrhosis, diarrhoea, gonorrhoea, haemorrhage, 
haemorrhoid, lactorrhoea, logorrhoea, pyorrhoea, 
pyrrhic. N.B. In catarrh, myrrh the <rrh> is nota 
separate grapheme - see Notes (but in catarrhal 
/r/-linking occurs - see section 3.6) 


<wr> except in awry, only in initial position and only 
in wrap, wrasse, wreck, wren, wrench, wrest(le), 
wretch(ed), wriggle, wring, wrinkle, wrist, write, 
wrong, Wrotham /'ru:tam/, wrought, wry and a 
few more rare words 


2-phoneme graphemes (none) 


NOTES 


The only stem words in which final <-rr, -rre, -rrh> occur are carr, charr, 
parr, err, chirr, shirr, skirr, whirr, burr, purr, barre, bizarre, parterre, catarrh, 
myrrh. Because there is no /r/ phoneme in these words (in RP), these letters 
do not form separate graphemes but are part of the trigraphs or four-letter 
graphemes <arr, err, irr, urr, arre, erre, arrh, yrrh> spelling variously /a1, 3:, ea/ 
- see the entries for those phonemes in sections 5.5.1, 5.5.2 and 5.6.3 and, for 
some suffixed forms, the next section. For err see also section 4.3.2. 

In words like preferring, referral, the <rr> is due purely toa spelling rule 
involving the suffix - see the next section and section 4.2. In such words 
the letters <err> spell the vowel /3:/ and the <rr> also spells the linking 
/r/ consonant - for /r/-linking see section 3.6, and for dual-functioning 
section 7.1. But in berry, errant, guerrilla, herring, wherry, abhorrent, 
demurral, garrotte, <e, 0, u, a> spell /e, v, A, a/ and the <rr> simply spells 
/r/ without influencing the pronunciation of the vowel; similarly in the other 
words listed above as having independently-occurring medial <rr>. 
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3.6 /r/-linking 


Although word-final /r/ does not occur in RP when words are pronounced 
in isolation, words which end in letter <r> after a vowel letter retain the 
possibility of a /r/ phoneme surfacing when a suffix or the next word begins 
with a vowel phoneme. For example, | pronounce the phrase dearer and 
dearer with three /r/ sounds, corresponding to the first three occurrences 
of the letter <r>: /'diararan'diara/. For more phonological detail see 
Cruttenden (2014: 224, 315-7). 

Many people call this phenomenon ‘<r>-linking’, using the name of the 
letter <r>.1 prefer to call it ‘/r/-linking’, using (in speech) the sound of, or 
(in writing) the symbol for, the phoneme /r/ because that is what the link 
consists of in speech. Moreover, various other graphemes which can spell 
/r/ allow /r/-linking - see, for example, <rrh> in catarrhal in the entry 
for /r/ just above. /r/-linking is one of four special processes which | have 
identified as operating in English spelling (for the others see section 6.10 
and chapter 7). 

In Table 3.2 | have assembled all the examples of /r/-linking mentioned 
in this book. 


NOTES TO TABLE 3.2: FULL LIST OF /r/-LINKING CATEGORIES. 


In some cases the pre-linking ‘phoneme’ is actually a 2-phoneme sequence. 

In a few categories where, before linking, the last phoneme of the stem 
is /a/ spelt <er, or>, /a/ is deleted in speech and <e, o> in writing, and the 
<r> is left to spell /r/. This process needs to be distinguished from vowel 
elision (see section 6.10), where a vowel letter is written even though there 
is no vowel phoneme at that point in the spoken word. 

Where <e>-deletion (see section 6.4) occurs, | analyse the phoneme 
before the linking /r/ (provided it has not been deleted or elided) as spelt by 
the pre-linking grapheme minus <e>, even when that phoneme has changed. 

Except where stated: 

1) stress placement and the phoneme before the linking /r/ remain 
unchanged; 

2) the /r/-linking grapheme continues to function as part of the spelling of 
the preceding phoneme (dual-functioning - see section 7.1), even when 
that phoneme has changed and/or <e>-deletion has occurred. This 
principle is adopted in order to avoid introducing some correspondences 
for which there is no other warrant in my analysis, e.g. <a> alone 
spelling /ea/ in vicarious. For more detail see section A.8 in Appendix A. 
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TABLE 3.2: FULL LIST OF /r/-LINKING CATEGORIES. 


Phoneme Grapheme | /r/-linking | Example(s) Notes 
before spelling grapheme 
/r/-linking | that 
phoneme 
polarise 
Stress shifts to last 
familiarity, hilarity, syllable of stem, the 
peculiarity, polarity, | vowel there shifts to /x#/ 
vulgarity and is spelt only by <a>, 
<ar> 
and <r> spells only /r/ 
Stress shifts to last 
£795 syllable of stem, and 
vicarious ; 
the vowel there shifts 
to /ea/ 
Stress shifts to last 
: syllable of stem, and 
ethereal, managerial : 
the vowel there shifts 
<er> to /1a/ 
hyperintelligent, 
one 9 /a/ may be elided - see 
interagency, leverage, i 
: section 6.10 
offering, sufferance 
/a/ <eur> <I> amateurish 
foundress, hindrance, | /a/ is deleted, as shown 
laundress, ogress, by the disappearance of 
<er> temptress, tigress, the penultimate <e> of 
waitress, wardress, the stem, and <r> spells 
wintry only /r/ 
/a/ is deleted, as shown 
actress, , 
by the disappearance of 
ambassadress, ‘ 
the penultimate <o> of 
conductress, 
: . _ | the stem, and <r> spells 
dominatrix, executrix 
only /r/ 
for instance, prioress, 
<or> 


terrorist 
Stress shifts to last 
syllable of stem, the 
authority vowel there shifts to /p/ 


and is spelt only by <o>, 
and <r> spells only /r/ 
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/2/ 


Stress shifts to last 


quthorial, 
<or> j : syllable of stem, and the 
dictatorial : 
<r> vowel there shifts to /3:/ 
. /3/ may be elided - see 
favourite ; 
section 6.10 
<r>, plus 
<our> deletion of . 
glamorise, 
<u> from ; : 
; rigorous, vigorous 
final syllable 
of stem 
; /a/ is deleted (as shown 
central, fibrous, : 
by disappearance of 
lustr-al/ous, 
; <e>), and <r> spells 
metrical, spectral 
only /r/ 
/a/ is deleted (as shown 
by disappearance of 
mediocrity, <e>), stress shifts to 
sepulchral, syllable before suffix if 
theatrical not already there, vowel 
there shifts to /p, A, 2&/, 
and <r> spells only /r/ 
<re> /a/ is deleted (as shown 
by disappearance of 
calibration <e>), stress shifts to first 
syllable of suffix, and 
<r> following <r> spells only /r/ 
<e>-deletion /a/ is not deleted (as 
acreage, : 
; shown by retention of 
massacreing, 
; <e>), and <r> spells only 
ochreous, ogreish 
/r/, but the schwa and /r/ 
/‘etkar1dg, 
‘ seem to be spelt by the 
mesokearin, ; 
<e> and <r> in reverse 
‘'aukearas, ‘augarif/ 
order 
Stress shifts to 2nd syll- 
injurious able of stem, and vowel 
there shifts to /jua/ 
/3/ is spelt only by the 
<ure> adventurous, <u> and may be elided, 


natural, naturist, 
procedural, 
treasury 


especially in derived 
adverbs - see section 
6.10 - and <r> spells 
only /r/ 
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TABLE 3.2: FULL LIST OF /r/-LINKING CATEGORIES, CONT. 


Phoneme Grapheme /r/-linking Example(s) Notes 
before spelling that | grapheme 
/r/-linking | phoneme 
murmuring 
/a/ Sie Stress shifts to last syllable 
sulphuric of stem, and the vowel 
there shifts to /jua/ 
Sir Stress shifts to first syllable, 
conference, and last vowel phoneme of 
<er> deference, stem shifts to /a/ (or may 
preference be elided - see section 
6.10) 
Preceding vowel shifts to 
<err> errant 
/e/, and <rr> spells only /r/ 
. : <rr> ate 
/3x/ <irr> whirring 
<urr> purring 
<rr> arising | conferring, deferring, 
<er> 
from preferring, referral 
consonant furry, occurring 
aps letter 
ur : Preceding vowel shifts to 
doubling demurral 4 a ecaal 
(see section /A/, and <rr> spells only /r/ 
<ar> 4.2) sparring 
<rr> 
<arre> following bizarrery 
<e>-deletion 
/ax/ 
<arrh> <rrh> catarrhal 
Stress shifts to suffix, and 
<ar> cigarette, czarina in cigarette vowel phoneme 
preceding /r/ shifts to /a/ 
/wa:/ <oir> memoirist 
<heir> inherit Too complicated to analyse 
; <r> ae 
<air> repairing 
tes <aire, heir> millionairess, heiress | Stress shifts to final syllable 
ea 
In mayoress, stress shifts to 
<ayor> mayoral, mayoress : 
final syllable 
<ear> wearing 
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<re> thereupon 
<ere> 
wherever, compering 
jea/ <r> 
following staring 
<are> : 
<e>-deletion preparedness 
. 2nd <e> surfaces as /1/ - 
entirety ; 
see section 7.2 
wiring 
Stress shifts to first syllable 
of suffix, vowel in last 
33 OP. 23 syllable of stem shifts to 
inspiration di | 
Zites /1/ or /a/ and is spelt only 
by the <i>, and <r> spells 
only /r/ 
jata/ <r> 
following Stress shifts to last syllable 
<e>-deletion of stem, the vowel there 
satirical shifts to /1/ and is spelt 
only by the <i>, and <r> 
spells only /r/ 
Vowel in stem shifts to /1/ 
lyrical and is spelt only by the <i>, 
SVIee and <r> spells only /r/ 
pyromaniac 
<ure> enduring, surety 
/09/ 
<oor, our> <r> boorish, touring 
<ar> <rr> arising warring 
from 
consonant Preceding vowel shifts to 
letter /v/ and is spelt only by 
<or> doubli abhorrent 
/o:/ oubling <o>, and <rr> spells only 
(see section /r/ 
4.2) 
<or, Oar, oor, mentoring, hoary, 
<r> 
our> flooring, pouring 
/o:/ <ore> boring 
<r> interfering 
Lares following Preceding vowel shifts to 
a <e>-deletion sincerity /e/ and is spelt only by <e>, 
1a 
and <r> spells only /r/ 
<ear, eer, dearer, hearing, 
ier> <r> cheering, tiering 
/avea/ <our, ower> devouring, towering 
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An even fuller analysis would also mention cases of /r/-linking occurring 
where an intervening consonant phoneme has been dropped, as in the 
place- and surname Wareham /'wearam/ (where the /h/ of the Anglo- 
Saxon placename element ham was dropped many centuries ago) and the 
British Tommy’s adage about medals: ‘Win ’em and wear ’em’ - here the 
end of the sentence is also pronounced /'wearam/, the /d/ phoneme of 
RP /'wea dam/ (with no /r/) having been elided. But this book is not about 
placenames, surnames or accents other than RP. 

Sometimes /r/-linking is overgeneralised to words which do not have 
a letter <r> in the written form (and never had, and still do not have, a 
/r/ phoneme in any accent of English when pronounced in isolation): the 
best-known example is law and order pronounced /'ld:ra'no:da/ (‘Laura 
Norder’) with ‘intrusive /r/’, rather than /'lo:wa'no:da/. (But this phrase 
never seems to be pronounced /'ldrra'ndd:da/ (‘Lauren Dawder’), with the 
<d> of and made explicit.) An example that occurs in children’s speech 
is drawing pronounced /‘drorrin/ rather than /‘droi1n/. Cruttenden (2014: 
316) provides several more examples. 

On the other hand, /r/-linking is sometimes avoided where the spelling 
suggests it would be natural. For example, the recorded announcers at 
Sheffield railway station say /‘platfoim fo: ‘er, 'mentfista ‘eapo:t, ‘meentf{ista 
‘oksfad 'raud, ‘fata auks/ rather than /'pletfo:m fo:'rer, ‘'maent{ista'reapo:t, 
‘meent{ista'roksfad 'raud, 'fararauks/ for ‘Platform 4A’, ‘Manchester Airport’, 
‘Manchester Oxford Road’, ‘Shireoaks’. 

Almost all instances of /r/-linking are also examples of what | call dual- 
functioning. That is, after linking, the <r>, etc., continues to function as 
part of the grapheme spelling the pre-linking phoneme while also spelling 
/r/ in its own right. Exceptions shown in Table 3.2 where an <r> ceases 
to function as part of the grapheme spelling the pre-suffixation phoneme 
and therefore only spells /r/ after suffixation are: familiarity, hilarity, 
peculiarity, polarity, vulgarity, foundress, laundress, ogress, temptress, 
tigress, waitress, wardress, actress, ambassadress, conductress, dominatrix, 
executrix, protrectress, authority, mediocrity, sepulchral, theatrical, central, 
fibrous, lustr-al/ous, metrical, spectral, calibration, demurral, inspiration, 
satirical, lyrical, abhorrent, sincerity. 

For other categories of dual-functioning see section 7.1. 

For cases in which /a/ may be elided after /r/-linking see also section. 
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For ‘linking /w/’ and ‘linking /j/’ see sections 3.8.7-8. Like /r/-linking, 
both occur frequently between a stem word and a suffix or a following word 
beginning with a vowel phoneme. However, there are two key differences: 
(1) in /w/- and /j/-linking, the quality of the glide between stem and suffix 
or next word is entirely predictable from the stem-final phoneme, whereas 
/r/-linking never is (in RP), and can be explained only historically - it occurs 
where once there was a postvocalic /r/; (2) similar /w/- and /j/-glides 
occur within many stem words where there is no indication of them in the 
spelling - /r/-linking never occurs within stem words. 


3.7 Consonants with doubled spellings which 
are regular at the end of one-syllable 
words after a short vowel spelt with one 
letter: /k tt f Blsvz/ 


In addition to their frequency in stem words, the doubled spellings of 
/k f 1 s/ occasionally arise from suffixation, e.g. picnicking, iffy, modelling, 
gassing (see sections 4.2 and 4.3.1). 


3.7.1 /k/ as in coo 


THE MAIN SYSTEM 


For all these categories see Notes and Table 3.3. 


Basic grapheme <c> 59% e.g. cat 
Regular in all positions except (1) before 
<e, i, y>, where the regular spelling is <k> 
(2) before final /al/ spelt <-le> aftera 
short vowel spelt with one letter, where the 
regular spelling is <ck> (3) word-finally 
in one-syllable words, where the regular 
spelling is <ck> after a short vowel spelt 
with one letter, otherwise <k> For other 
exceptions see below 
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Other frequent <k> 21% 
grapheme 


Doubled spelling <ck> 6% 


Frequent /ks/ 5% 
2-phoneme spelt 
grapheme <x> 


Rare grapheme <q> 3% 


THE REST 

Doubled spelling 

+ <e> 

Oddities 
<cc> 
spells /k/ 


regular before <e, i, y>, e.g. kelp, kit, sky, 
including word-finally within split digraphs, 
e.g. like, make; also word-finally in one- 
syllable words except those where <ck> is 
regular. Only exceptions: ache, Celt, Celtic, 
sceptic and one pronunciation of words 
beginning encephal-, arc, chic, disc, anda 
few more words 


regular in word-final position in one- 
syllable words after a short vowel spelt 
with one letter, e.g. crack; also before final 
/al/ spelt <-le> after a short vowel spelt 
with one letter, e.g. heckle - see section 
4.3.3; for other occurrences medially in 
stem words see Table 3.3; there are several 
word-final occurrences in polysyllables, 
e.g. derrick, dunnock, haddock, hammock, 
hummock, slummock 


word-initially, only in the Greek letter- 
name xi pronounced /ksat/; regular 
medially, e.g. buxom, maxim, next (for 
exceptions see below); also finally where 
the /s/ is part of the stem, e.g. box (only 
exception: aurochs) 


e.g. quick See <cq, cqu, qu, que> within 
the Oddities, below, and Notes 


(does not occur) 


6% in total 


- before <e, i, y>: only in baccy, biccy, recce /'reki:/ 
(short for reconnoitre), soccer, speccy, streptococci 
- where the next letter is not <e, i, y>: in about 

45 words mainly of Latin origin, namely acclaim, 
acclimatise, accolade, accommodate, accompany, 
accomplice, accomplish, accord, accost, account, 


<cch> 


<ch> 
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accoutrement, accredit, accrete, accrue, acculturate, 
accumulate, accurate, accursed, accuse, accustom, 
desiccate, occasion, occlude, occult, occupy, occur, 
Succour, succubus, succulent, succumb. 

Words of non-Latin origin in this group are broccoli, 
buccaneer, ecclesiastic, felucca, hiccup, mecca, 
moccasin, peccadillo, peccary, piccolo, raccoon, 
Scirocco, staccato, stucco, tobacco, toccata, Wicca, 
yucca 

See Notes for the complementary value of <cc> 
before <e, i, y>, and on why <cc> is not the 
doubled spelling of /k/ 


only in bacchanal, Bacchante, bacchic, ecchymosis, 
gnocchi, saccharide, saccharine, zucchini. \n 
bacchanal, Bacchante, ecchymosis, saccharide, 
saccharine, the <h> could be deleted without 
altering the English pronunciation - see just above; 
but in bacchic, gnocchi, zucchini this change might 
make them look as if they were pronounced with 
/ks/ 


mainly in words of Greek origin, e.g. amphibrach, 
anarchy, anchor, archaic and every other word 
beginning /a:k/ (except arc, ark), brachial, 
brachycephalic, bronchi(-al/ tis), catechis-e/m, 
chalcedony, chameleon, chaos, character, charisma, 
chasm, chemical, chemist, chiasma, chimera, 
chiropody (also pronounced with initial /f/), 
chlamydia, chloride, chlorine, choir, cholesterol, 
cholera, choral, chord, choreograph(-er/y), chorus, 
chrism, Christian, Christmas, Chris(tophen, 
chrome, chromosome, chronic and every other word 
beginning /kron-/, chrysalis, chrysanth(emum), 
chyle, chyme, cochlea, diptych, distich, drachma, 
echo, epoch, eschatology, eucharist, eunuch, 
hierarch(y) and every other polysyllabic non- 
compound word ending /a:k(i:)/ (except aardvark), 
hypochondriac, ichor, lichen pronounced /'latkan/ 
(also pronounced /'‘I1t{an/), machination, malachite, 
mechani-c/sm, melanchol-y/ic, monarch(-y/ic), 
ochlocracy, ochre, orchestra, orchid, pachyderm, 
parochial, pentateuch, psyche and all its derivatives, 
scheme, schizo and all its derivatives, scholar, 
scholastic, school, stochastic, stomach, strychnine, 


50 Dictionary of the British English Spelling System 


<cq> 


<cqu> 


<cu> 


<g> 


<gh> 
<ke> 


<kh> 


<kk> 


<qu> 


synchronise, synecdoche, technical, technique, 
trachea, triptych, trochee. 

Words of non-Greek origin in this group are ache, 
aurochs, baldachin, chianti, chiaroscuro, cromlech, 
Czech, lachrymose, masochist, Michael, mocha, 
oche, pinochle, pulchritude, scherzo, schooner, 
sepulchre; also broch, loch, pibroch, Sassenach 
when pronounced with /k/ rather than Scots /x/ 
(for this symbol see section 2.3) 


only in acquaint, acquiesce, acquire, acquisitive, 
acquit 


spells only /k/ (not /kw/) only in lacquer, picquet, 
racquet 


only in biscuit, circuit (contrast ‘circuitous’ where the 
<u> ‘surfaces’ - see section 7.2.2); cf. <bu> under 
/b/, section 3.5.1, and <gu> under /g/, section 3.5.3 


only in length, lengthen, strength, strengthen 
pronounced /lenké, ‘lenk@an, strenk®, 'strenkOan/ 
(for their alternative pronunciations see under /n/, 
section 3.5.5) - for the rationale of this analysis 
see Notes under /n/, section 3.8.2 - and in angst 
/enkst/, disguise /dis'katz/, disgust pronounced 
/dts'kast/, i.e. identically to discussed; disguise, 
disgust are also pronounced /diz'gatiz, di1z'gast/, 
i.e. with both medial consonants voiced rather than 
voiceless 


only in hough 
only in Berkeley, burke 


only in astrakhan, gurkha, gymkhana, khaki, khan, 
khazi, khedive, sheikh, Sikh 


only in chukker, dekko, pukka and inflected forms 
of trek, e.g. trekkie 


as a digraph spelling only /k/ (not /kw/) occurs 
initially or medially (never finally - cf. next 
paragraph) in about 50 words mainly of French 
origin, namely bouquet, conquer (/w/ surfaces 

in conquest - see section 7.2), coquette, croquet, 
croquette, etiquette, exchequer, liqueur, liquor, 
liquorice, maquis, mannequin, marquee, marquetry, 
masquerade, mosquito, parquet, piquant, quatrefoil, 
quay, quenelle, quiche, so(u) briquet, tourniquet, and, 
in conservative RP-speakers’ accents, questionnaire, 
quoits; also medially in applique, communique, 
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manque, risque - see next paragraph; also 
phonemically but not orthographically word- 

final in opaque; claque, plaque; basque, casque, 
masque; antique, bezique, boutique, clique, critique, 
mystique, oblique, physique, pique, technique, 
unique; bisque, odalisque; toque; peruque; brusque 
pronounced /bru:sk/, and a few more rare words 
where the final written <e> is part of a split digraph 
with a preceding vowel letter spelling variously /e1, 
az, it, au, ur/. The words basque, casque, masque, 
bisque, odalisque and brusque pronounced /brursk/, 
where there is also an <s> before the <qu>, cause 
a special extension to the definition of a split 
digraph - see section A.6 in Appendix A and the 
Notes under <a.e, i.e, u.e>, sections 10.4/24/38 


<que> as occurs word-initially only in queue and medially 

a trigraph only in milquetoast (where it is nevertheless stem- 

spelling only final in a compound word); otherwise only word- 

/k/ (not /kw/_ finally and only in about 18 words mainly of French 

plus vowel) origin, namely: 
(1) with a preceding consonant letter such that 
<que> could be replaced by <k> without changing 
the pronunciation: arabesque, barque, basque, 
brusque pronounced /brask/ (also pronounced 
/bru:sk/), burlesque, casque, catafalque, grotesque, 
marque, masque, mosque, torque and the derived 
forms picturesque, romanesque, statuesque. 
However, in this group barque, basque, casque, 
marque, masque, torque are kept visually distinct 
from bark, bask, cask, mark, mask, torc 
(2) with a preceding vowel letter such that <que> 
could be replaced by <ck> without changing the 
pronunciation: baroque, cheque (cf. US check), 
monocoque, plaque pronounced /plek/ (also 
pronounced /pla:k/) 


<x> spells only in coxswain and before /s/ spelt <c> ina 
/k/ (not /ks/, small group of words of Latin origin, namely exceed, 
etc.) excellent), except, excerpt, excess, excise, excite 
Other 2-phoneme See also Notes 
graphemes 
/kf/ 
(1) only in flexure, luxury, sexual /'flekfa, ‘Iakfari:, 


spelt <x> 'sekf(urw)al/ 
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(2) only in anxious, complexion, connexion (also spelt 
spelt <xi> connection), crucifixion, fluxion, (ob)noxious 

/ks/ 

(1) only in annexe, axe, deluxe, (River) Exe. The <e> 
spelt <xe> in axe is redundant, as the US spelling ax shows 


(but cf. the ‘Three-Letter Rule’, section 4.3.2). The 
<e> in annexe /'zeneks/ (‘addition to building or 
document’) is also phonologically redundant (and 
mainly omitted in US spelling) but, where used, 
differentiates this word visually from annex /a'neks/ 
(‘take over territory’). Similarly, deleting the final 
<e> from the French spelling of de luxe would get 
too close to soap and washing powder 


(2) only in exhibition, exhortation, exhumation - for 
spelt <xh> exhibit, exhort, exhume see under /g/, section 3.5.3 


3-phoneme /eks/ only in X-ray, etc. One of only two 3-phoneme 
grapheme spelt <x> graphemes in the whole language 
NOTES 


For adverbs with the unstressed ending /r1kli:/ spelt <-ically> see section 6.10. 
It is unphonological but true that it is easier to state the main 

correspondences of /k/ in terms of following letters rather than following 

phonemes. (For an attempt to do it phonologically see Carney, 1994: 217). 
<k> is used to spell /k/ mainly before the letters <e, i, y>, that is, just 

where <c> would usually spell /s/ - see below. There are very few exceptions: 

1) where /k/ is spelt <c> despite being before <e, i>: Celt, Celtic, sceptic, 
all of which have alternative spellings with <k> (and the Glasgow 
football club is in any case /'selttk/), arced, arcing, synced, syncing 
(which means that the spelling synch for this verb is better); also several 
words beginning encephal.-, all of which have two pronunciations, with 
/s/ (where the spelling with <c> is regular) or /k/ (where it is irregular), 
e.g. encephalitis /en'sefalaitas, en'kefalaitas/ - note too the alternation 
between /n, n/ in the first syllable. Also, in July 2006 the derived form 
chicest /‘fitkist/ appeared on a magazine cover, and in May 2010 ad 
hocery appeared in the Guardian. There seem to be no exceptions with 
/k/spelt <c> before <y> 

2) where /k/ is spelt <k> despite not being before <e, i, y>: alkali, askance, 
blitzkrieg, hokum, kale, kangaroo, kaolin, kappa, kapok, kaput, klaxon, 
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kleptomaniac, koala, kohl, kopek, Koran, kosher, krypton, kwashiorkor 

(twice), leukaemia, mazurka, oakum, okay, okra, paprika, polka, sauerkraut, 

shako, skate, skulk, skull, skunk, sudoku, tektite, ukulele. 
<k> is also the regular spelling of /k/ at the end of one-syllable words where 
the preceding phoneme is NOT a short vowel, e.g. bark (only exception: 
burke); also in two-syllable words after a consonant letter and before final 
/al/ spelt <-le> - but there are very few words in this set, namely ankle, 
crinkle, rankle, sparkle, sprinkle, tinkle, twinkle, winkle, wrinkle (exceptions: 
circle, uncle). 

<q> is almost always used to spell /k/ when followed by /w/, which in this 
context is always spelt <u>. For exceptional spellings of /kw/, see /w/, section 
3.8.7. 

In addition to its single-grapheme spelling, <x>, /ks/ has several 
2-grapheme spellings. There are none in initial position (where /ks/ perhaps 
occurs only in xianyway). Word-finally, where the /s/ is not part of the stem the 
regular spelling in one-syllable words after a short vowel is <cks>, and <ks> 
in other one-syllable words; <cs> is regular in polysyllables - for exceptions 
to all of these see Table 3.3. Medially, there are three 2-grapheme spellings of 
/ks/: <xc> is rare - see the last entry among the Oddities above; <cs> is even 
rarer - it seems to occur only in ecstasy, ecstatic, facsimile, frolicsome, tocsin 
(contrast toxin); <cc> occurs in a few words mainly of Latin or French origin 
where the following letter is <e, i, y>, namely accede, accelerate, accent, accept, 
access, accident, accidie, coccyx, eccentric, flaccid pronounced /'flaks1d/ (also 
pronounced /'flazs1d/), occident, occiput, Occitan(e), succeed, success, succinct 
pronounced /sak's1nkt/ (also pronounced /sa'sinkt/), vaccine. 

It is because <c> before <e, i, y> almost always spells /s/ that <cc> can 
not function as the doubled spelling of /k/ - before a suffix beginning with <e, 
i, y> the second <c> would represent /s/ (as in the group of words just listed). 
So when a suffix beginning with a vowel letter is added to words ending in /k/ 
spelt <c>, the <c> is usually doubled to <ck>, e.g. bivouacked, picnicking, 
trafficked - but the principle of avoiding <c> spelling /k/ before <e, i, y> is 
not applied to arced, arcing, chicest (cf. above and section 4.2). <cc> also has 
the very rare pronunciation /t{/ only in bocce, cappuccino (see next section). 

/kf/ has scarcely any 2-grapheme spellings, but cf. baksheesh. 

The word ache is one of the few where the split grapheme <a.e> has two 
consonant letters in its midst (it could equally well be spelt “ake). 

As Carney says (p.216), ‘/k/ is the most divergent of the consonants’. For 
this reason, a further analysis of the major spellings of /k/ is given in Table 3.3. 


54 Dictionary of the British English Spelling System 


TABLE 3.3: THE DISTRIBUTION OF <c, ck, k, x> IN SPELLINGS OF /k, ks/. 


In each main box below, the regular spelling is stated at the top or above 


the relevant set of words. (For exceptions besides those in the Table, see 


the 2- and 3-phoneme graphemes and Oddities above). 


kagoul (also 
spelt cagoule), 
kale, kangaroo, 
kaolin, kappa, 
kapok, kaput, 
kayak, klaxon, 
klepto-maniac, 
koala, kohl, 
kopek, Koran, 
kosher, 
krypton 


spelt with one letter and final 
/al/ spelt <-le>, e.g. crackle 
(see section 4.3.3), plus beckon, 
buckaroo, buckshee, chickadee, 
cockatoo, cockaigne, cockatrice, 
cockney, gecko, hackney, hickory- 
dickory, huckster, jackanapes, 
lackadaisical, reckon, rucksack, 
sackbut 

With <k>: in two-syllable words 
after a consonant letter and 
before final /al/ spelt <-le>, 
namely ankle, crinkle, rankle, 
sparkle, sprinkle, tinkle, twinkle, 
winkle, wrinkle (exceptions: 
circle, uncle), plus alkali, askance, 
blitzkrieg, hokum, leukaemia, 
mazurka, oakum, okay, okra, 
paprika, periwinkle, polka, 
sauerkraut, shako, skate, skulk, 
Skull, skunk, sudoku, 


initial medial final 
/k/ <k>, e.g. kelp, | <k>, e.g. sketch, skit, sky <k> 
before kit, kyle Exceptions: sceptic; lichen Occurs only within split 
<e,i, y>| Exceptions: pronounced /‘latkan/ (also digraphs, e.g. cake, eke, bike, 
Celt, Celtic pro-nounced /'l1t{fan/); chicken, poke, rebuke, fluke, tyke 
cricket, jacket, mackerel, Exception: ache 
pernickety, pickerel, pocket, 
rocket, sprocket; mackintosh; 
finicky 
Words ending in <-c> usually 
add <k> when a suffix beginning 
with a vowel letter is added, e.g. 
bivouacked, picnicking, trafficked 
(but arced, arcing, chicest do not) 
/k/ not | <c>, e.g. cake, | <c>, e.g. scale, eclogue, scorch, In one-syllable words after 
before close, coal, across, acute a short vowel spelt with one 
<e,i, y>| cross, cute Exceptions: letter: <ck>, e.g. back, beck, 
Exceptions: With <ck>: between short vowel trick, clock, duck 


Exceptions: bloc, choc, doc, 
hic, mac, roc, sac, Sic, spec, 
tec, tic; flak, suk, trek, yak 
In other one-syllable words: 
<k>, e.g. ark, bank, brook, 
freak 

Exceptions: arc, chic, disc, 
franc, orc, talc, torc, torque, 
zinc 

In polysyllables: <c>, e.g. 
politic 

Exceptions: 

With <ck>: alack, attack, 
bailiwick, bannock, barrack, 
bollock, bullock, burdock, 
buttock, cassock, Cossack, 
derrick, dunnock, fetlock, 
fossick, gimcrack, gimmick, 
haddock, hammock, hassock, 
haversack, hemlock, hillock, 
hollyhock, hummock, limerick, 
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tektite, ukase, ukulele 

With <x> spelling only /k/ 
because the following <c, sw> 
spells /s/: exceed, excel, excellent, 
except, excerpt, excess, excise, 
excite; coxswain 


mattock, maverick, niblick, 
paddock, pollack, ransack, 
rollick, rollock, rowlock, 
rucksack, shamrock, 
slummock, tussock, warlock 
With <k>: aardvark, asterisk, 
basilisk, batik, bergomask, 
berserk, Bolshevik, bulwark, 
damask, kopek, mountebank, 
muzak, obelisk, Slovak, 
Sputnik, springbok, tamarisk, 
tomahawk, yashmak 


/ks/ Very rare - <xX>, e.g. maxim, next, toxin 


perhaps occurs | Exceptions: 
only in Greek |- where <x> occurs but spells 
letter name xi | only /k/ (see box above): exceed, 
pro-nounced 


/ksat/ 


excel, excellent, except, excerpt, 
excess, excise, excite; coxswain; 
- others: accede, accelerate, 
accent, accept, access, accident, 
accidie, coccyx, eccentric, flaccid, 
occident, occiput, Occitan, 
Succeed, success, succinct, 
vaccine; ecstasy, ecstatic, 
facsimile, frolicsome, tocsin 


Where /s/ is part of stem: 
<x>, e.g. fax, perplex, six, box, 
influx, pyx 

Only exception: aurochs 
Where /s/ is not part of stem, 
= is a suffix: all the non- 
suffixed forms of such words 
belong in the two boxes above, 
and in all these cases, when 
suffixed, /ks/ is spelt with the 
non-suffixed grapheme plus 
<s> 


3.7.2 /t{/ as in chew 


THE MAIN SYSTEM 


For all these categories see Notes 


regular initially except before /u:/, e.g. chin, 
church, also finally (except in one-syllable 


words after a short vowel spelt with one letter), 


e.g. church (exceptions: despatch, dispatch, 


eldritch); rare medially, but cf. bachelor, 


Basic <ch> 65% 
grapheme 

duchess 
Other <t> 25% 
frequent 


grapheme 


tune; never word-final 


regular medially, e.g. actual, intuition; also 
initially before /ur/ spelt <u, u.e>, e.g. tulip, 


56 Dictionary of the British English Spelling System 


Doubled <tch> 10% word-initially, only in Tchaikovsky, rare 

spelling medially - does NOT occur before final /al/ 
spelt <-le> - but there are a few examples, 
e.g. butcher, crotchet, hatchet, ketchup, 
kitchen, patchouli, pitcher, ratchet, satchel, 
(e)scutcheon, tetchy, wretched; regular in 
word-final position in one-syllable words 
after a short vowel spelt with one letter, e.g. 
match, exceptions: much, rich, such, which, 
niche pronounced /nit{/, kitsch, putsch; also 
(irregularly after a diphthong/long vowel) in 
aitch, retch 


THE REST 
Doubled spelling + <e> (does not occur) 
Oddities <1% in total 
<c> only in cellist, cello, cicerone (twice), concerto (second 
<c>) 


<cc> only in bocce, cappuccino 


<che> only in niche pronounced /ntt{/, which could be spelt 
“nitch; niche is also pronounced /ni:f/ 


<ci> only in ancient, ciabatta 

<cz> only in czardas /'t{a:def/, Czech /tfek/ 
<te> only in righteous 

<th> only in posthumous 


<ti> only in con/di/indi/in/sug-gestion, question, 
rumbustious and the derived forms combustion, 
exhaustion. In words like nation, lotion, equation | 
count the <i> as part of a digraph with the preceding 
consonant letter - see /J, 3/, sections 3.8.3-4 


<tsch> only in kitsch, putsch 


2-phoneme graphemes (none) 


NOTES 


Because /tf{/ is a sibilant consonant, adding any of the suffixes regular 
noun plural and third person singular person tense verb (spelt <es> - for 
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an exception see next sentence) and regular singular and irregular plural 
possessive (spelt <’s>) to a stem ending in /t{/ adds a syllable /1z/ as well 
as a morpheme: matches, detaches, (the) Church’s (mission). The only word 
ending in /t{/ where the stem already ends in <e> appears to be niche 
pronounced /nitf{/; in this case the ending is just <s>. See also /z/, section 
3.7.8, and /1/, section 5.4.3. 
The regular spellings of /t{/ are: 
initially, <t> before /u:/, otherwise <ch> 
medially, <t> 
finally, <tch> in one-syllable words after a short vowel spelt with one 
letter, otherwise <ch>. 
Examples: 
initial <t> before /ur/: tuba, tube, tuber, Tuesday /'t{u:zdi:/, tuition 
/tf{u:'wifan/, tulip, tumour, tumult(uous), tumulus, tuna, tune 
pronounced /'t{furn/, tunic, tureen, tutor 
initial <ch> otherwise: chin, church 
medial <t>: 
a small set of words ending in /tfan, tfas/ spelt <-tian, -tion, -tious>: 
Christian, combustion, con/di/indi/in/sug-gestion, exhaustion, question, 
rumbustious 
many nouns ending in /tfa/, which is mostly spelt <-ture>, e.g. 
adventure, capture, creature, culture, picture 
a set of adjectives ending in /t{u:was/, which are all spelt <-tuous>, 
e.g. tortuous, virtuous 
a small set of nouns in <-tuary>: actuary, estuary, mortuary, obituary, 
sanctuary, statuary, voluptuary, whether pronounced with /tf{urwari:/ 
or /tfari:/. For the elision of the <u> see section 6.10 
a larger set of adjectives in <-tual>: accentual, actual, conceptual, 
contractual, effectual, eventual, factual, habitual, intellectual, mutual, 
perpetual, punctual, ritual, spiritual, textual, virtual, etc., whether 
pronounced with /tfu:weal/ or /tfal/. For the elision of the <u> see 
again section 6.10. The elision seems even more prevalent in the 
derived adverbs in <-tually> 
a small set of words where /t{/ is spelt <t> and a following medial 
/a/ (always in the penultimate syllable, with the stress on the 
antepenultimate syllable) is spelt <u>: century, congratulate, fistula, 
flatulen-ce/t, fortunate, petulan-t/ce, postulant, postulate, saturate, 
spatula, titular 
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a ragbag of words with /t{/ spelt <t> followed by /u:/ spelt <u, 
u.e> (occasionally in word-final position as <ue>), e.g. impromptu; 
gargantuan, perpetuate; attitude, multitude, solitude; habitué; statue, 
virtue; intuition, pituitary, costume; fortune, importune, opportune; 
virtuoso; obtuse; de/in/pro/sub-stitute, de/in/pro/re/sub-stitution 
just two words where the stress falls on the syllable following /t{/ 
spelt <t> and that syllable contains <ur(e)> spelling /ua/: centurion, 
mature. In all the other words with medial /t/ spelt /t{/ listed above 
the stress falls on an earlier syllable. 
final <tch> after a short vowel spelt with one letter in monosyllables: 
match, sketch, pitch, botch, hutch, butch 
final <ch> otherwise: attach, arch, church. 
Exceptions (other than those listed under Oddities): 
initial <t> other than before /u:/: none 
initial <ch> before /ur/: only chew, choose 
initial spellings other than <t, ch>: <tch> only in Tchaikovsky 
medial spellings other than <t>: only archer, bachelor, cochineal, 
duchess, duchy, lecher, lichen pronounced /'l1tfan/ (also pronounced 
/‘latkan/), macho, treacher-y/ ous, butcher, crotchet, hatchet, ketchup, 
kitchen, patchouli, pitcher, ratchet, satchel, (e)scutcheon, tetchy, 
wretched as stem words, but there are also many derived forms, 
e.g. lurcher, marcher, matching, preacher, righteous, (re)searcher, 
teacher, also the words in <ti> listed in the Oddities 
final <tch> in monosyllables after a diphthong or long vowel: only 
aitch, retch pronounced /ri:t{/ (also pronounced /ret{/, where <tch> 
is regular) 
final <tch> in polysyllables: only despatch, dispatch, eldritch 
final <ch> in monosyllables after a short vowel: only much, rich, such, 
which 
final spellings other than <tch, ch>: see Oddities. 
As a spelling of /t{/, <ti> is rare and occurs only at the beginning of the final 
syllable of a stem word and immediately after a stressed syllable ending in 
/s/ spelt <s>, e.g. question. 

All the words in which /t{/ is spelt <t> were formerly pronounced with 
the sequence /tj/, and conservative RP-speakers may still pronounce them 
that way (or imagine that they do). Pronunciations with /tj/ would require an 
analysis with the /t/ spelt <t> and the /j/-glide either spelt <i> (where that 
is the next letter) or subsumed into the spelling of a 2-phoneme sequence 
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with the following vowel. However, | think that in current RP the process of 
affricating /tj/ to /t{/ is virtually complete (as Cruttenden, 2014: 83 says), 
and has eliminated pronunciations with /tj/, which | have therefore ignored. 
An English friend who once did a year’s teaching exchange in a primary 
school in the United States would ask the pupils every day, ‘What day is it?’ 
and on Mondays, Wednesdays, Thursdays and Fridays they would answer. 
But on Tuesdays they would insist she give the name of the day, and would 
then delightedly point out, ‘You say Choozdee /'t{urzdi:/!’ (as opposed to 
their /'tu:zdi:/ ‘Toozdee’, where the /j/-glide has been dropped without 
affricating the /t/, or perhaps was never present). 

For the parallel affrication of /dj/to /d3/ see section 3.7.4, and see also 
section 5.4.7. 


3.7.3 /f/ as in few 


THE MAIN SYSTEM 

Basic <f> 84% e.g. fish 

grapheme 

Other <ph> 11% in many words of Greek origin, e.g. philosophy. 
frequent See Notes 

grapheme 

Doubled <ff> 4% regular in word-final position in one-syllable 
spelling words after /a:/ spelt <a>, e.g. staff, and 


after a short vowel spelt with one letter, e.g. 
gaff, cliff, off, gruff, cf. section 4.3.5 and /a:/, 
section 5.5.1; for off see also section 4.3.2 
(exceptions: graph, gaffe, chef, clef, if, strafe 
is not an exception because it has /a:/ spelt 
<a.e>); also regular medially before final /al/ 
spelt <-le> after a short vowel spelt with one 
letter, e.g. duffle - see section 4.3.3; there 

are also some independent medial examples, 
e.g. affray, buffet (both pronunciations and 
meanings), chiffon, offer, proffer, ruffian, soffit, 
suffer - see section 4.3.5. Also see Notes 
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THE REST 

Doubled <ffe> only in gaffe, giraffe, pouffe; also in usual pronunciation of 

spelling + <e> difference, different, sufferance - for the elided vowel see 
section 6.10 (but not in afferent, efferent) 

Oddities 1% in total 


<fe> only in carafe, housewife (‘sewing kit’ pronounced /'hazif/) 
and, in rapid speech, conference, deference, preference, 
reference - for the elided vowel see section 6.10 


<ft> only in often, soften pronounced /'pfan, 'sofan/ 


<gh> medially, only in draught and derived forms of the following 
words; otherwise only word-final and only in chough, cough, 
enough, laugh, rough, slough (‘shed skin’), sough, tough, 
trough 


<pph> only in sapphic, sapphire, Sappho /'seftk, 'sefata, 'sefau/ 
<v> only in kvetch, svelte, svengali, veldt 


2-phoneme (none) 
graphemes 


NOTES 


In monosyllables, the default spelling is <f>, except where <ff> is regular 
as defined above. There are a few exceptions with the Greek <ph> spelling: 
graph, lymph, morph, phase, phone, phrase. 
In polysyllabic stem words, there are the following tendencies in the 
distribution of the three main spellings of /f/: 
<ph> occurs almost solely in words of Greek origin, and <f, ff> in 
other words - but how (unless you have studied Classical Greek, as | did, 
or know modern Greek) can you tell which words are of Greek origin? 
Though few people could answer this explicitly, many internalise the 
word-elements which have that origin and require the <ph> spelling, 
e.g. graph, lymph, morph, phag, phall, pharmac, pharyng, phase, 
pheno, phleb, phil(e/o), phob(e), phon(e/ic/o-), phor(e), phosph, photo, 
phrase, phren, phyll, phys, phyt(e/o), soph, spher, taph. As Carney 
(1994: 229) points out, there is further guidance towards <ph> in 
polysyllabic words if other Greek word-elements are present, e.g. 
anthrop, apo, chloro, chron, crypto, dia, dys, epi, eu, geo, hiero, hydro, 
hyper, hypo, lexi, macro, meta, micro, oid, ology, ortho, peri, scop, syn, 
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tele, thus aiding correct spelling of, e.g., apocrypha(l, chlorophyll, 
cryptographer, diaphragm, euphemism, euphoria, hieroglyphics, 
metaphor, psephology. 
Words with the element /fant/ were all once spelt with <ph> and 
most still are (e.g. phantasm, phantom) but, awkwardly, three of the 
commonest are now spelt with <f>: fantasise, fantastic, fantasy. 
The Greek elements neo, para may be misleading; they have taken 
on lives of their own in modern English (e.g. neocon, paramedic, 
where the second element in each word has a Latin origin) and might 
therefore mislead writers into, e.g., “neofite, “paraprophessional. 
Beyond this, one can only list some of the commoner of the other 
words containing <ph>: alpha, aphid, aphrodisiac, asphyxiate, 
blaspheme, cenotaph, dolphin, elephant, hyphen, lymph, nymph, 
orphan, phalanx, pheasant, phenol, phial, philistine, phloem, phoenix, 
siphon/syphon, sphinx, sylph, trophy, zephyr, and four words where 
the pronunciation varies between /f/ and /p/: diphtheria, diphthonag, 
naphtha, ophthalmic. Words of non-Greek origin in this set are: 
caliph, cipher/cypher, nephew (also pronounced with /v/), pamphlet, 
pharaoh, Pharisee, phwoar!, samphire, seraph, triumph. 

Nothing but a source of confusion would be lost if all these words with 

/f/ spelt <ph> were instead spelt with <f>, as the cognate words are 

in Italian and Spanish. 

<ff> 

1) For 2-syllable words ending in /al/ spelt <-le> and with a short 
vowel spelt with one letter in the first syllable, see above. 

2) There is a strong tendency for /f/ to be written <ff> in the middle 
of two-syllable words where the immediately preceding vowel 
phoneme is short and written with a single letter, e.g. offer. For 
examples and exceptions, see section 4.3.5. 

3) There is also a strong tendency for /f/ to be written <ff> rather 
than <f> at the end of the third from last syllable of a word /f/ 
is an exception to a wider rule in this respect. For examples and 
exceptions, see section 4.4.5. 

4) Word-finally in polysyllables not of Greek origin <ff> predominates: 
bailiff, caitiff, chiffchaff, dandruff, distaff, handcuff, mastiff, 
midriff, plaintiff, pontiff, rebuff, riffraff, sheriff, tariff, plus 
fisticuffs; contrast belief, (hand)kerchief, mischief, relief, caliph, 
seraph, triumph. 
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5) In the few remaining words not of Greek origin, <ff> again 
predominates: affidavit, affiliate, affinity, effeminate, efficacious, 
effrontery, paraffin, ragamuffin; contrast cafeteria, defeasible, 
deferential, defibrillate, defoliant, nefarious. 

<f> is the default spelling. The only generalisation, however, is that 

it is regular in consonant clusters in words not of Greek origin, e.g. 

afraid, after, deflate, deflect, defray, defrock, kaftan; exceptions 

include affray, effrontery. 


3.7.4 /&/ as in jaw 


THE MAIN SYSTEM 


For all these categories see also Notes. 


Basic grapheme <j> 29% never word-final; regular initially (e.g. jet), 
and medially when not followed by <e, 
i, y>, e.g. ajar, banjo, cajole, conjugal, 
enjoy, juju, major, (maha)rajah, sojourn. 
On this basis, the initial <j> in jujitsu is 
regular, but the medial one is not 


Other frequent <g> 51% never word-final (except Reg, veg); regular 
graphemes medially before <e, i, y> 


<ge> 10% word-initial only in geograph-er/y, 
geomet-er/ry, Geordie, George, Georgian), 
georgic; rare medially, but cf. burgeon, 
dungeon, gorgeous, hydrangea, pageant, 
pigeon, sergeant, sturgeon, surgeon, 
vengeance where the following /2a/ (or 
/t/ in pigeon) is spelt <a, 0, ou>; also 
dangerous, vegetable if <e> is elided 
— see section 6.10; also in the derived 
forms singeing, swingeing to prevent 
confusion with singing, swinging, and 
bingeing, spongeing, whingeing to avoid 
the misapprehension that there might be 
verbs to ‘bing, to “spong, to “whing (but 
fringing, impinging never retain the <e>); 
mostly word-final, e.g. binge, blancmange, 
change, disparage, flange, fringe, garage 
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Doubled <dge> 5% 
spellings (with 
<dg>) 
<dg> 
THE REST 


Doubled spelling + <e> 


Oddities 


pronounced /'gzrid3/, haemorrhage, 
hinge, image, language, lounge, mortgage, 
orange, impinge, scavenge, singe, sponge, 
village, whinge and hundreds of words 
ending in <-age> where the <e> is also 
part of the split digraph <a.e>, e.g. age, 
rage, stage. See section 7.1 for dual- 
functioning, section 10.4 for<a.e>, and 
section A.6 in Appendix A for the rarity of 
other split digraphs with included <g> 


never word-initial or medial; regular in 
word-final position in one-syllable words 
after a short vowel spelt with one letter, 
e.g. bridge, judge. See next paragraph, 
and section 6.4 on when <e>-deletion 
does and does not occur before suffixes 
beginning with a vowel letter 


never word-initial or -final; medially, 

does NOT occur before final /al/ spelt 
<-le>, and most occurrences arise 

from deleting <e> from <dge> before 
suffixes beginning with a vowel letter, 

e.g. bridging. However, there are a few 
words with independent medial /d3/ spelt 
<dg>: badger, budgerigar, budget, budgie, 
codger, cudgel, didgeridoo, dodgem, 
fidget, gadget, kedgeree, ledger, midget, 
podgy, smidgen, smidgin, todger, widget; 
also, as more obviously belonging to 

this set than to a set with medial <dge>, 
provided <eo> is recognised as a spelling 
of /a/ (see p.155), bludgeon, curmudgeon, 
dudgeon, gudgeon, smidgeon, widgeon 


(cannot occur because unsuffixed word-final 
doubled spelling already ends in <e>) 


5% in total 
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2-phoneme 
graphemes 


<ch> 


<d> 


<di> 


<dj> 


<gg> 


<gi> 


<jj> 


(none) 


only in ostrich, sandwich, spinach pronounced 
/‘ostr1d3, 'semwids, 'sprnidz/ 


never word-final; frequent initially and medially 
before /u:, 3, Ua/ spelt with various graphemes 
involving letter <u>, namely <eu, eur, ew, u, 

ua, ue, U.e, ur, Uure>, e.g. (initially) deuce (cf. the 
homophone juice), various words beginning with 
(Greek) deuter-; dew, due (which are homophones, 
and cf. the further homophone Jew); dual/ duel 
(cf. the homophone jewel), duet, duty, dune, dupe; 
durable, duration, duress, during; (medially) 
grandeur, arduous, assiduous, (in)credulous, 
deciduous, education, fraudulen-ce/t, graduate 
pronounced either /'grzedgu:wat/ (noun) or 
/‘gredgu:weit/ (verb), glandular, modul-e/ar, 
nodul-e/ar, pendulum, sedulous; gradual, 
individual, residual whether pronounced with 
/dgu:weal/ or /dgal/ (for the eliding of the <u> see 
section 6.10); endure, procedure, verdure (cf. the 
homophone verger). /r/-linking occurs in the 
derived forms endurance, procedural - see section 
3.6. See also Notes 


only in cordial pronounced /'k>:dgal/ (also 
pronounced /'ko:di:jal/), incendiary, intermediary, 
stipendiary, subsidiary pronounced with /-dgeari:/, 
soldier 


only in about 10 words of Latin origin: adjacent, 
adjective, adjoin, adjourn, adjudge, adjudicate, 
adjunct, adjure, adjust, adjutant, plus djinn 


only in arpeggio, exaggerate, loggia, suggest and 
the derived forms Reggie, veggie, vegging. The 
last three words appear to be the only examples 
of <gg> spelling /d3/ arising from consonant- 
doubling before a suffix - see section 4.2 


only in allegiance, contagio-n/us, egregious, legion, 
litigious, plagiaris-e/m, region, religio-n/us and the 
derived forms collegial, prestigious, vestigial 


only in hajj 
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NOTES 


The words geograph-er/y, geomet-er/ry could alternatively be analysed as 
having initial /d3/ spelt <g> and the following /a/ spelt <eo>, but this 
would entail a counter-intuitive analysis of Geordie, George, Georgia(n), 
georgic as having /3:/ spelt <eor>, so | have retained the analysis of 
geograph-er/y, geomet-er/ry as having initial /dj/ spelt <ge> and the 
following /a/ spelt <o>. 

Because /dg/ is a sibilant consonant, addition of any of the suffixes 
regular noun plural and third person singular person tense verb (both spelt 
<s> where the stem ends in <e>, otherwise <es>) and regular singular 
and irregular plural possessive (spelt <’s>) to a stem ending in /d3/ adds a 
syllable /1z/ as well as a morpheme: languages, sandwiches, (the) bridge’s 
(collapse). See also /z/, section 3.7.8, and /1/, section 5.4.3. 

To summarise, the regular spellings of /dg/ are: 

in word-initial position: <j> (73% of spellings in that position) 
in medial position before <e, i, y>: <g> 
in medial position otherwise: <j> 
in stem-final position in unsuffixed one-syllable words after a short 
vowel spelt with one letter: <dge> 
in stem-final position when <dge> loses the <e> before a suffix 
beginning with a vowel letter (see section 6.4): <dg> 
otherwise in word-final position (including dual-functioning of the 
<g> within split digraphs): <ge>. 

Exceptions (in addition to the Oddities): 
initial <g> (27% of spellings in that position): gaol, gee, gel (/del/ 
‘viscous liquid’; contrast ge! pronounced /gel/, ‘posh’ version of girl), 
gelatin, gelignite, gem, geminate, Gemini, Gemma, gen, gender, gene, 
general, generate, generic, generous, genial, genie, genital, genitive, 
genius, gent, gentle, genuflect, genuine, genus, most words beginning 
with (Greek) geo ‘earth’, e.g. geographic (but not those listed above as 
having initial /d3/ spelt <ge>), Geoff(rey), geranium, gerbil, geriatric, 
germ, German, gerrymander (also spelt with <j>), gerund, gestate, 
gesture, giant, gibber, gibbet, gibe, giblets, gigantic, gigolo, gill 
(/dgil/ ‘quarter of a pint’; contrast gill /g1l/ ‘lung of fish’), Gillingham 
/‘dgiltnam/ in Kent (but /'gtlinam/ in Dorset and Norfolk), gimbal(s) 
(also pronounced with /g/), gimcrack, gin, gingerly), ginseng, gipsy, 
giraffe, giro, gist, gym(nas-t/ium), gyp, gypsum, gypsy, gyrate, various 
words beginning with (Greek) gyro-, gyves 
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medial <j> before <e, i>: only in jujitsu, majest-y/ic and words with 

the Latin element <ject> (‘throw’), namely ab/de/e/in(tery)/ob/pro/re/ 

sub-ject, conjecture, trajectory (no exceptions before <y>) 

medial <g> not before <e, i, y>: only in margarine (also pronounced 

with /g/), second <g> in mortgagor 

medial <dg>: see list above 

medial <ge>: see list above 

final <g>: only in Reg, veg 

final <dge> in words of more than one syllable: only in abridge, 

cartridge, (ac)knowledge, partridge, porridge. 
All the words in which /d3/ is spelt <d> were formerly pronounced with 
the sequence /dj/, and conservative RP-speakers may still pronounce them 
that way (or imagine that they do). Pronunciations with /dj/ would require 
an analysis with the /d/ spelt <d> and the /j/-glide subsumed into the 
spelling of a 2-phoneme sequence with the following vowel. However, | 
think that in current RP the process of affricating /dj/ to /d3/ is virtually 
complete (as Cruttenden, 2014: 83 says) and has eliminated pronunciations 
with /dj/, which | have therefore ignored. 

For the parallel affrication of /tj/to /t{/ see section 3.7.2, and see also 
section 5.4.7. In the case of /t{/ there are very few spellings competing with 
<t> in initial and medial positions before /u:/, etc., and <t> is therefore the 
regular spelling. However, for initial and medial /d3/ there are many words 
spelt with <j> before /u:/, etc., so that <d> cannot be considered the 
regular spelling in these circumstances, or ‘promoted’ to the main system. 
This also means that words in which /d3/ is spelt <d> cannot be predicted 
and just have to be learnt. 

For the ending /1d3/ see also under /1/, section 5.4.3. 


3.7.5 /I/ as in law 


THE MAIN SYSTEM 


Basic grapheme <I> 75% e.g. lift 
Doubled spelling <II> 18% e.g. fill. See Notes 
Frequent 2-phoneme_ /al/ 8% e.g. dazzle, debacle, table, 


grapheme spelt <-le> visible. See Notes 
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THE REST 


Doubled spelling + <e> <lle> medially, only in decollet-age/e; otherwise 
only final. Regular in the ending -ville, e.g. 
vaudeville; also in bagatelle, belle, braille, 
chanterelle, espadrille, fontanelle, gazelle, 
grille, pastille, nacelle, quadrille (but not 
reveille, tagliatelle where the <e> spells 
/ix/). In chenille, tulle | analyse /|/ as spelt 
<Il> and <i.e, u.e> as split digraphs spelling 
/ixt, us/ - see sections 5.7.2, 5.7.6, A.6 - and 
medially in guillemot <\le> spells /li:/ 


Oddities <1% 


<gl> only in a few Italian loanwords, e.g. 
imbroglio, intaglio, seraglio, tagliatelle 


<le> except in Charles, only word-final and only 
in aisle, cagoule, clientele, gargoyle, joule, 
isle, lisle, voile. On isle, lisle see also /at/ 
spelt <is>, section 5.7.3 


<lh> only in philharmonic, silhouette 


Other 2-phoneme /al/ only word-final and only in axolotl, dirndl, 
graphemes spelt <I> shtetl 


/\j/ only in carillon 
spelt <II> 


NOTES 


At 18%, <II> has the highest frequency of all the doubled consonant 
spellings (at least in stem words, i.e. discounting consonant-doubling 
before suffixes): 
It occurs in the two exceptional word-initial doubled consonant 
spellings /lama, Ilano. 
It is regular in word-final position in one-syllable words after /5:/ 
spelt <a>, e.g. all, and after a short vowel spelt with one letter, e.g. 
Shall, ell, ill, moll, cull, bull. Exceptions: col, gal, gel (both /gel/, 
posh pronunciation of girl, and /dgel/ ‘lotion’), mil (abbreviation of 
millimetre), nil, pal; gal, pal would otherwise look identical to gall, 
pall and be pronounced with /9:/. For all, ell, ill see also section 4.3.2. 
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There is a strong tendency for /I/ to be spelt <Il> in the middle of 
two-syllable words where the immediately preceding vowel phoneme 
is short and written with a single letter. For examples and exceptions, 
see sections 4.3.4 and 4.4.6. 
There is also a preference for /I/ to be spelt <II> rather than <I> at 
the end of the third from last syllable of a word. /I/ is an exception to 
a wider rule in this respect. For examples and exceptions, see section 
4.4.5. 
There appear to be only a few polysyllabic non-compound words 
ending in <Il>: chlorophyll and some rare words in -phyll, idyll, 
plimsoll (also spelt plimsole). All other non-compound polysyllables 
end in <I>, except those listed above under <lle>. 
Similarly, there appear to be only three three-syllable words with 
<Il> at the end of the second syllable/beginning of the last syllable: 
embellish, intellect, parallel. 
In words of more than three syllables, no generalisation seems 
possible, so here is a list of such words which have <ll>: allegory (3 
syllables if the <o> is elided), alleviate, alligator, alliterate, various 
words beginning with (Greek) allo-, ballerina, calligraphy, camellia, 
collaborate, collateral, fallopian, hallelujah, hallucinate, hullabaloo, 
illegible, illegitimate, illiberal, illimitable, illiterate, illuminate, 
mellifluous. Alleviate, alliterate and all those just listed beginning 
<coll-> and <ill-> belong to the (to most people, meaningless) 
category of words with Latin roots and assimilated Latin prefixes - see 
section 4.3.1. 

On reducing <II> to <I> in compound words see section 4.4.7. 

In non-final positions, the 2-phoneme sequence /al/ only has straight 
(non-reversed) 2-grapheme spellings, e.g. allowed, aloud. But in final 
position, although the reversed spelling <-le> predominates, there is 
considerable variation between that and several non-reversed spellings. 
The situation is too complex to summarise at this point; see sections 4.3.3 
and 4.4.2-3. 

For 2-grapheme spellings of /Ij/ see under /j/, section 3.8.8. 

For <lle> in chancellery, jewellery see section 6.10. 
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3.7.6 /s/ as in sue 


THE MAIN SYSTEM 


For all these categories see Notes and Tables 3.4-5. 


Basic grapheme <s> 79% e.g. Sat, persuade, bias 

(with Regular 

<se,ss>) (1) initially and medially where the next 
letter is not <e, i, y>; 
(2) in most unstressed final syllables of 
polysyllables; 
(3) in various suffixes and contracted 
forms after a non-sibilant voiceless 
consonant; 
(4) within split digraph <o.e> 


Other frequent <c> 15% e.g. city, decide 
graphemes (with Regular initially and medially where 
<ce>) the next letter IS <e, i, y>. Never 
word-final 
<ce> e.g. fence, mice 


Except in a few suffixed forms (see 
section 6.4), only word-final, where 

it is regular after /n/ and when the 
<e> is also part of split digraphs <a.e, 
i.e, u.e, y.e>, €.g. ace, ice, puce, syce 
(for dual-functioning see section 

7.1, and for split digraphs section 

A.6 in Appendix A), but is otherwise 
unpredictable 


<se> only word-final, where it is regular 
after /I, p/, after <r> forming part of 
a vowel di-/tri-graph, and after most 
vowel digraphs 
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Doubled 
spelling 


Rare 
2-phoneme 
grapheme 


THE REST 


Doubled 
spelling + <e> 


Oddities 


<Ss> 


/ks/ 
spelt 
<xX> 


<sse> 


<cc> 


<ps> 


<sc> 


regular word-finally in one-syllable 
words after /a:/ spelt <a> and after 

a short vowel spelt with one letter, 

e.g. grass, fuss; also in suffixes <-ess, 


-less, -ness> and in stressed final 


syllables of polysyllables 


see under /k/, section 3.7.1. Though 
rare as a correspondence for /s/, this 
counts as part of the main system 
because of its higher frequency as a 
correspondence for /k/ 


except in divertissement, only word-final, e.g. 
bouillabaisse, crevasse, duchesse, finesse, 
fosse, impasse, lacrosse, largesse (also spelt 
largess), mousse, noblesse, palliasse, wrasse 
and a few more rare words 


6% in total 


only in flaccid, succinct pronounced 
/‘flasid, sa'sinkt/ (also pronounced with /ks/) 


only word-initial and only in some words 

of mainly Greek origin, e.g. psalm, psalter, 
psephology, pseud(o) and many compounds, 
psionic, psittacosis, psoriasis, psych(e/o) and 
many compounds, and a few more very rare 
words. /p/ surfaces in metempsychosis - see 
section 7.2 


only in abscess, abscissa, adolescen-t/ce, 
ascend, ascertain, ascetic, corpuscle, crescent 
(also pronounced with /z/), descend, discern, 
disciple, fascicle, fascinate, isosceles, lascivious, 
miscellany, muscle, (re)nascent, obscene, 
omniscient, oscillate, plebiscite, prescient, 
proboscis, proscenium, rescind, resuscitate, 
scenario, scene, scent, sceptre, sciatic(a), 
Science, scimitar, scintilla, scion, scissors, 


<sce> 


<sch> 


<st> 
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scythe, susceptible, transcend, viscera(l, viscid 
and a few more rare words, plus suffixed 
derivatives of next group. /k/ surfaces in 
corpuscular, muscular - see section 7.2 


only in stressed final syllables of verbs ending 
/es/ spelt <-esce>, e.g. acquiesce, coalesce, 
convalesce, deliquesce, effervesce, evanesce 
and some other very rare words, plus reminisce 


only in schism pronounced /'stzam/ 


only medial. 

Between a short vowel spelt with one letter 
and final /al/ spelt <-le>, <st> is the regular 
spelling of /s/, but this is a small set: pestle, 
trestle, bristle, Entwistle, epistle, gristle, thistle, 
Thistlethwaite, Twistleton, whistle, apostle, 
jostle, Postlethwaite, throstle, bustle, hustle, 
rustle and the derived forms nestle, wrestle 
(exceptions: hassle (but | once received an 
email with this word spelt “hastle, showing 
the power of this sub-rule), tassel, corpuscle, 
muscle, tussle) (/t/ surfaces in apostolic, 
castellan, castellated, epistolary - see section 
7.2); also, with a preceding long vowel (in 

RP), castle; also occurs before final /an/ 

spelt <-en>, but this is an even smaller set: 
glisten, listen and the derived form christen 
with preceding short vowels, fasten with a 
preceding long vowel (in RP), and the derived 
forms chasten, hasten, moisten with preceding 
long vowels or diphthongs. 

The only other examples of medial /s/ spelt 
<st> within stem words appear to be forecastle 
in either of its pronunciations /'fauksal, 
'foikarsal/, mistletoe, ostler. In nestle, wrestle, 
christen, chasten, hasten, moisten, fasten, 

/t/ has been lost at a morpheme boundary. 
Other examples of compounds with lost /t/ so 
that <st> spells /s/ are chestnut, Christmas, 
durstn’t, dustbin, dustman, mustn’t, waistcoat 
/‘wetskaut/ and sometimes ghastly. 
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Other 
2-phoneme 
graphemes 


3-phoneme 
grapheme 


<sth> 


<SwWw> 


<t> 


<Z> 


/ts/ spelt <z, zz> 


/ks/ spelt <xe> 


/ks/ spelt <xh> 


/eks/ spelt <x> 


This loss of /t/ at a morpheme boundary is one 
small aspect of a very frequent process which 
is too widespread and complicated to tackle in 
this book, focused as it mainly is on citation 
forms of stem words - see Appendix A, section 
A.l 


only in asthma, isthmus if pronounced without 


/8/ 


only in answer, coxswain, sword /'a:nsa, 
‘koksan, sd:d/ and boatswain pronounced 
/‘bausan/ (also pronounced /'bautswein/) 


only the penultimate <t> in about 10 words 
ending in <-tiation>, e.g. differentiation, 
initiation, negotiation, propitiation, 
transubstantiation, and only for RP-speakers 
who avoid having two occurrences of medial 
/J/ in such words (see Notes under /Jf/, section 
3.8.3). In French, on the other hand, <t> is one 
of the most frequent correspondences for /s/ 


only in blitz(krieg), chintz, ersatz, glitz, 
howitzer, kibbutz, kibitz, klutz, lutz, pretzel, 
quartz, ritz, schmaltz, schnitzel, seltzer, 
spritz(en), Switzerland, waltz, wurlitzer 


see under /t/, section 3.5.7 


only in annexe, axe. See comments under /k/, 
section 3.7.1 


only in exhibition, exhortation, exhumation 
- for exhibit, exhort, exhume see under /g/, 
section 3.5.3 


see under /k/, section 3.7.1 
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NOTES 


For the very few occasions when <ss> is reduced to <s> in compound 
words see section 4.4.7. 

/s/ is the phonological realisation of various grammatical suffixes 
(regular noun plural and third person singular person tense verb (both spelt 
<s> where the stem ends in <e>, otherwise <es>), regular singular and 
irregular plural possessive (spelt <’s>)), and of is, has when contracted 
(also spelt <’s>), after any voiceless non-sibilant consonant (/p t k f 8/). As 
just shown, in all these cases the spelling contains <s>. 

However, because /s/ itself is a sibilant consonant, adding any of the 
suffixes just listed to a stem ending in /s/ adds a syllable /1z/ as well as a 
morpheme: horses, fusses, Brooks’s. On this and the topic of the previous 
paragraph see also /z/, section 3.7.8, and /1/, section 5.4.3. 

Because /s/ is almost as divergent as /k/, a further analysis of the major 
spellings of /s/ is given in Tables 3.4-5. As with /k/, it is unphonological 
but true that it’s easier to state all the initial and medial correspondences 
of /s/, and some of the stem-final ones, in terms of following letters rather 
than following phonemes. (For an attempt to do it phonologically see 
Carney, 1994: 234-6). 

There are several words in <-se> in which the <e> appears redundant 
since the <s> alone would spell /s/ and the <e> is not part of a split 
digraph, namely carcase (also spelt carcass), purchase; mortise (also 
spelt mortice), practise, premise (also spelt premiss), promise, treatise 
(cf. thesis); purpose; porpoise, tortoise; apocalypse, apse, collapse, eclipse, 
elapse, ellipse, glimpse, prolapse, relapse, traipse. \n copse, corpse, lapse 
the <e> is equally redundant phonographically but serves to differentiate 
these words visually from cops, corps, laps. 

The virtual non-existence of <-oce> and the rarity of <-oss> as spellings 
for word-final /aus/ mean that <-ose> is almost entirely predictable as the 
spelling of this stem-word ending. However, this seems to be one of very 
few examples where a pattern of this sort is more reliable than predictions 
from the separate phonemes. On this see also section A.7 in Appendix A. 

For the medial /s/ in eczema see section 6.10. 
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TABLE 3.4: THE DISTRIBUTION OF <c, s, ss> IN INITIAL AND MEDIAL SPELLINGS OF 
/s/ OTHER THAN IN /ks/ (FOR /ks/ SEE ABOVE AND UNDER /k/, SECTION 3.7.1). 


In each main box below, the regular spelling is stated at the top in bold. 


For other exceptions see the 2- and 3-phoneme graphemes and 
Oddities above. 


Initial medial 
/s/ not <s>,e.g. sale, scale, skull, slime, <s>, e.g. descant, askance, asleep, 
before smooth, snake, soap, spill, still, suave, | dismiss, consonant, dyspnoea, disaster 
<e>, swede (second <s>), persuade, aswill 
<i>, <y> | Exceptions (all with <c>): caecum, Exceptions (there are none where the 
caesium, caesura, coelacanth, next letter is a consonant, but those 
coelenterate, coeliac, coelom, in capitals in this list are exceptions 
coenobite, coenocyte (there are none where the next phoneme is a 
where the next phoneme/ letter is a consonant): apercu, facade (lacking 
consonant) French cedillas); ambassador, assail, 
assassin (first <ss>), assault, assay, 
cassava, commissar, dissatisfy, essay, 
massacre, pessary, reconnaissance, 
renaissance; hassle, tussle; associate, 
assonance, assorted, bassoon, blossom, 
caisson, dissociate, dissolute, dissonant, 
lasso, lesson, lissom, voussoir, alyssum, 
ASSUAGE, assume pronounced 
/as'ju:m/, DISSUADE, also EMISSARY, 
NECESSARY, PROMISSORY with elided 
vowels (see section 6.10) 
/s/ <c>, e.g. ceiling, city, cyclic <c>, e.g. accept, decide, bicycle 
before Exceptions (all with <s>): sea, seal, Exceptions: 
<e>, seam, seance, sear, search, season, Words ending in /s1s/ are spelt <-sis>, 
<i>, <y> | seat, sebaceous, sebum, secant, e.g. Sis (abbreviation of sister), thesis 
Sec-ateurs, secede, seclude, second, (only exception: diocese) 
Secret(e), secretary, sect(ion), secular, | Words ending in /siti:/ preceded by a 
Secure, sedan, sedate, sedentary, consonant letter or by /p/ spelt <o> are 
sedge, sediment, sedition, seduce, spelt <-sity>, e.g. adversity, density, 
See, seed, seek, seem, seep, seethe, diversity, falsity, immensity, intensity, 
segment, segregate, segue, self, sell, pervers-ity, propensity, sparsity, 
semantic, semaphore, semblance, university, varsity, animosity, curiosity, 
semen, semi, seminar, semiotics, generosity, impetuosity, verbosity, 
semite, semolina, senate, send, senile, | virtuosity, viscosity (exceptions: 
senior, sense, sensual, sent-ence, scarcity, atrocity, ferocity, precocity, 
sentient, sentiment, sentinel, sentry, reciprocity, velocity, and cf. city itself) 
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sepal, separate, sept, septic, Words ending in /siv/: if preceded 
sepulchre, sequel, sequence, seques- by a short vowel spelt with one letter 
ter, sequin, seraph, serenade, serene, | they are spelt <-ssive> (e.g. massive), 
Serf, serge, sergeant, serial, series, otherwise <-sive> (e.g. adhesive) (but 
serious, sermon, serpent, serrated, N.B. sieve itself) 

serum, serve, Sesame, Session, set, Other exceptions: 

Settle, seven, sever, severe, sew, with <s>: abseil, absent, arsenal, 

sewer (in both pronun-ciations and arsenic, beseech, consecutive, consensus, 
meanings), sex; sibilant, sibling, consent, consequen-ce/t, corset, counsel, 
Sibyl, sick, side, sidereal, sidle, siege, | desecrate, disembark, dysentery, insect, 
Siesta, sieve, sift, sigh, sight, sign, morsel, prosecute, transept; basin, 
Sikh, silage, silent, silica, silk, sill, consist, disinfect, disinherit, misinform, 
Silly, silo, silt, silver, simian, similar, misinterpret, parasite, transit; asylum; 
simmer, simper, simple, simultan- apostasy, argosy,controversy, courtesy, 
eous, sin, since, sincere, sine, sinew, ecstasy, fantasy, greasy, heresy, 

sing, singe, single, sinister, sink, hypocrisy, idiosyncrasy, jealousy, 
sinuous, sip, siphon, sir, sire, siren, leprosy, minstrelsy, pleurisy, prophesy 
Sisal, sister, sit, sitar, site, situation, (verb, pronounced /'proftsar/ - the 

six, size, sizzle; sybarite, sycamore, noun prophecy, pronounced /'proftsi:/, 
syce, sycophant, syllable, syllogism, has regular <c>); autopsy, biopsy, 
sylph, symbiosis, symbol, symmetry, catalepsy, curtsy, dropsy, epilepsy, 
sympathy, symphony, symposium, gipsy, narcolepsy, necropsy, tipsy 
symptom, synaesthesia, synagogue, with <ss>: antimacassar, assegai, 
synapse, synch(ro-), synergy, synod, assemble, assent, assert, assess, asset, 
synonym, synopsis, syntax, synthesis, | casserole, cassette, connoisseur, cussed 
syphilis, syphon, syringe, syrup, (‘stubborn’), delicatessen, dissect, 
system, syzygy dissemble, disseminate, dissension, 


dissent, dissertation, disservice, essence, 
essential, fricassee, lessen, masseu-r/ 
se, mussel, necessity, trousseau; 
admissible, assassin (second <ss>), 
assiduous, assign, assimilate, assist, 
assize, brassica, bassinet, chassis, 
classic, classify, dissident, dissimilar, 
dissipate, fossil, gossip, jurassic, 
lassitude, messiah, permissible, (im) 
possible, potassium, prussic, triassic; 
embassy, hussy 
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TABLE 3.5: THE DISTRIBUTION OF <c, ce, s, se, ss> IN STEM-FINAL SPELLINGS OF /s/ 
OTHER THAN IN /ks/ (ALSO EXCLUDING GRAMMATICAL SUFFIXES). 


For /ks/ see above and under /k/, section 3.7.1. For /s/ as a grammatical 


suffix, see above. 


Categories listed 


in the left-hand column below apply to both 


monosyllables and polysyllables except where stated. 


For exceptions besides those in the Table, see the 2- and 3-phoneme 


graphemes and Oddities above. 


In mono-syllables after 
/a:/ spelt <a> and after a 
short vowel spelt with one 
letter: <ss> 


Examples: brass, class, glass, grass, pass; ass, bass (/bxs/ ‘fish’), 
crass, lass, mass; bless, cess, chess, cress, dress, guess, less, mess, 
ness, press, stress, tress; bliss, hiss, kiss, miss, pss; boss, cross, 
doss, dross, floss, loss, moss; buss, cuss, fuss, muss, truss; PUSS. 
Exceptions: gas, yes, Sis (abbreviation of sister), this, bus, plus, 
pus, thus, us 

Extension: There appears to be only one other one-syllable stem 
word in which a long vowel/diphthong is spelt with a single letter 
before word-final /s/: bass (/bets/ ‘(player of) large stringed 
instrument’ /‘(singer with) low-pitched voice’) 


After /n/: <ce> 


Examples: (monosyllables) dance, chance, glance, lance, prance, 
Stance, trance; fence, hence, pence, thence, whence; mince, 
prince, quince, since, wince; nonce, once, sconce; bounce, flounce, 
ounce, pounce, trounce; dunce; (polysyllables) abundance, 
evidence and hundreds of other words ending <-ance/-ence>, 
convince, evince, province, ensconce, an/deX mis) pro-nounce 
Exceptions: (monosyllables) manse; dense, sense, tense; rinse; 
(polysyllables) expanse; condense, dispense, expense, immense, 
incense (noun and verb), intense, license, nonsense, recompense, 
Suspense; response 


After /l, p/: <se> 


Examples: (monosyllables) dulse, else, false, pulse; apse, copse, 
corpse, glimpse, lapse, traipse; (polysyllables) convulse, impulse, 
repulse; apocalypse, eclipse, ellipse, col/e/pro/re-lapse 
Exceptions: (none) 


After <r> forming part 
of a vowel di-/tri-graph: 
<se> 


Examples: (monosyllables) arse, coarse, course, curse, Erse, 
gorse, hearse, hoarse, horse, morse, Norse, nurse, purse, sparse, 
terse, verse, worse; (polysyllables) adverse, averse, concourse, 
converse, discourse, disburse, disperse, diverse, endorse, 
immerse, intercourse, intersperse, inverse, obverse, recourse, 
rehearse, reimburse, remorse, reverse, traverse, transverse, 
universe 

Exceptions: (monosyllables) farce, fierce, force, pierce, scarce, 
source, tierce; (polysyllables) commerce, divorce, enforce, 
reinforce, perforce, resource 
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After other vowel 
digraphs: <se> 


Examples: (monosyllables) cease, crease, grease, lease, geese, 
goose, loose, moose, noose, douse, grouse, house (noun), louse, 
mouse; (polysyllables) decease, de/in-crease, release, porpoise, 
tortoise, caboose, papoose, vamoose 

Exceptions: (monosyllables) sauce, peace, fleece, deuce, niece, 
piece, choice, voice, juice, sluice; gneiss; (polysyllables) invoice, 
rejoice 


In words ending /aus/: 

<s> within split digraph 
<o.e> spelling /au/ (see 
also Notes above Table) 


Examples: (monosyllables) close (adjective/noun), dose; 
(polysyllables) the ‘sugar’ words dextrose, glucose, lactose, 
sucrose (all of which have alternative pronunciations in /auz/), 
and adjectives comatose, lachrymose, morose, verbose and 
dozens of others (see also section A.7 in Appendix A) 

Only exceptions: Groce (rare surname); gross, engross 


Where any other long 
vowel, diphthong or 
/ju:/ is spelt with a split 
digraph: <ce>, such that 
the <e> functions as 
part of both graphemes 
(for dual-functioning see 
section 7.1) 


Examples: (monosyllables) ace, brace, dace, face, lace, grace, 
mace, pace, place, race, space, trace; dice, ice, lice, mice, nice, 
price, rice, slice, spice, splice, thrice, trice, twice, vice; puce, 
spruce, truce, syce; (polysyllables) apace, de/ef/out/re-face, 
disgrace, embrace, en/inter-lace, dis/mis/re-place, retrace; 
advice, caprice, device, entice, police, sacrifice, suffice; 
ad/com/de/e/in(tro)Kre)pro/re/se/tra-duce, prepuce 
Exceptions: (monosyllables) base, case, chase; use (noun); 
(polysyllables) 

a/de-base, encase; obese; concise, paradise, precise; abstruse, 
obtuse, recluse; also merchandise, abuse, excuse, refuse as nouns 
and diffuse as an adjective 


In stressed final syllables 
of polysyllables: <ss> 


Examples: abyss, address, amiss, assess, caress, confess, discuss, 
dismiss, distress, duress, excess, express, impress, morass, 
possess, process (verb), profess, progress (verb), prowess, recess, 
redress, repress, remiss, success 

Exceptions: none (?) 


In polysyllables ending in 
unstressed /sis/: <s> 


Examples (from many that could be given): 
(anti/meta/syn-)thesis, catharsis 
Only exception: diocese 


In other cases of final 
unstressed /1s/ in 
polysyllables: <ce> 


Examples (from many that could be given): apprentice, auspice, 
chalice, justice, practice 

Exceptions: axis, cannabis, marquis, metropolis, pelvis; practise, 
premise, promise, treatise; premiss 

N.B.: mortice, mortise has both spellings, and cf. Latin (rigor) 


mortis (‘(stiffness) of death’) 
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TABLE 3.5: THE DISTRIBUTION OF <c, ce, s, se, ss> IN STEM-FINAL SPELLINGS OF /s/ 
OTHER THAN IN /ks/ (ALSO EXCLUDING GRAMMATICAL SUFFIXES), CONT. 


In other unstressed final 
syllables of polysyllables: 
<s> 


Examples (from many that could be given): bias, canvas, 
corpus, cosmos, fabulous, horrendous, rickets, syllabus, 
tonsilitis, virus 

Exceptions: 

<se> only in carcase, purchase, purpose 

<ss> in abscess, access, albatross, blunderbuss, buttress, 
canvass, Carcass, compass, congress, cutlass, egress, 
embarrass, empress, harass, ingress, isinglass, mattress, all 
the compounds of press, process (noun), progress (noun), 


trespass, windlass 


3.7.7 /V/ as in view 


THE MAIN SYSTEM 


Basic grapheme 


Other frequent 
grapheme 


Doubled spelling 


THE REST 


Doubled spelling + <e> 


<v> 98% e.g. oven 


<f> only in of, and roofs pronounced /ru:vz/ 
(neither counted in percentages) 


<ve> 2% regular in word-final position, e.g. 
give, have, positive. Exceptions: bruv, 
chav, derv, gov, guv, lav, leitmotiv, of, 
rev, satnav, shiv, Slav, sov, spiv,; <ve> 
also spells /v/ in average, deliverable, 
evening (noun, ‘late part of day’, 
pronounced /'i:vnin/, as distinct from 
the verb of the same spelling, ‘levelling’, 
pronounced /'i:vantn/), every, leverage, 
several, sovereign - cf. section 6.10 - but 
is very rare medially in stem words (but 
see Notes) and never occurs initially 


(cannot occur because doubled spelling 
already ends in <e>) 
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Oddities <1% in total 
<bv> only in obvious pronounced /'pvi:jas/ 
<ph> only in nephew pronounced /'nevju:/ (also 
pronounced /'nefju:/), Stephen (also spelt 
Steven) 
<w> only medial and only in bevvy, bovver, chavvy, 


chivvy, civvy, divvy, flivver, lavvy, luw-ie/y, 
navvy, revving, savvy, Skivvy, spivvery, spivvy 


2-phoneme graphemes (none) 


NOTES 


<w> is very rare (it occurs only in the words just listed, most of which are 
slang), and in word-final position <ve> functions in its place as the doubled 
spelling of /v/ (see next paragraph). 

English spelling has a tacit rule that words must not end in <v>, except 
for a few slang and foreign words and modern abbreviations (see above 
under ‘Doubled spelling’). Word-finally, therefore, the regular spelling of 
/v/ is <ve>, which occurs in at least 1,000 words. At least 700 of these are 
polysyllabic adjectives and nouns ending in unstressed /1v/ spelt <-ive>, 
e.g. adjective, endive, expletive, gerundive, massive, narrative, olive, 
relative. (A century ago Dewey tried, but failed, to have the phonographically 
redundant final <e> removed from the US spelling of these words; it is 
equally redundant in all the non-split digraph categories mentioned below, 
but is probably even more resistant to change there. He might have had 
more success if he had advocated removal of the redundant final <e> in 
words ending in <-ate, -ite> where the <e> is also not part of a split 
digraph - for these words see sections 3.5.7 and 9.34 - or any of the other 
graphemes with redundant final <e>, of which there are many). 

Among the small number of remaining polysyllables are the preposition 
above, the noun octave, and groups of words with: 

1) /I/ spelt <I> preceding <ve>, e.g. dissolve, evolve 

2) vowel digraphs preceding <ve>, e.g. bereave; receive and several other 
words in <-ceive>; deserve; achieve, believe, relieve, reprieve, retrieve 

3) split digraphs where the <e> is part of both the digraph and <ve>, e.g. 
behave, conclave, forgave; alive, archive, arrive, deprive, naive, ogive, 
recitative, revive, survive; alcove, mangrove. 

Among monosyllables with final /v/ spelt <ve> are those with 
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1) 


2) 


3) 


4) 


a short vowel phoneme immediately preceding <ve>, namely have; give, 
live (verb, /l1v/); dove, glove, love, shove; and - with its unusual digraph 
spelling of a short vowel - sieve 

/\/ spelt <I> preceding <ve>, e.g. salve pronounced /selv/), valve; 
delve, shelve, twelve; solve 

vowel digraphs preceding <ve>, e.g. waive; calve, halve, salve 
pronounced /satv/; carve, starve; mauve; greave, heave, leave; sleeve; 
nerve, serve; grieve, thieve; groove; curve 

split digraphs where the <e> is part of both the digraph and <ve>, 
€.g. gave, shave, suave, wave; breve, eve; drive, five, hive, jive, live 
(adjective, /larv/), swive, wive; cove, drove, move, prove; gyve. 


The point of this long analysis has been to show in how few words ending 


in a single vowel letter plus <ve> the <e> forms digraphs both with the 


<v> and with the vowel letter (for dual-functioning see section 7.1). In 


this respect <ve> is very unlike the other principal word-final consonant 


digraphs formed with <e>, namely <ce, ge, se>. 


Medially, <ve> occurs in: 
the few words mentioned above under ‘Doubled spelling’ 
a large number of regular plural nouns and singular verbs, e.g. haves 
(vs have-nots), gives, grieves, initiatives, dissolves, lives (verb), loves, 
improves, preserves, mauves 
a small number of irregular plural nouns ending in /vz/ spelt <-ves> 
where the singular forms have /f/ spelt <f>, namely calves, dwarves, 
elves, halves, hooves, leaves, loaves, scarves, (our/your/them-)selves, 
Sheaves, shelves, thieves, turves, wharves, (were)wolves. On 4/1/15 
the form behalves appeared in the Observer; various websites decry 
this form as obsolete or unnecessary 
a very few similar words where the <f> in the singular is within the 
split digraph <i.e>: knives, lives (/latvz/; the singular verb of the 
same spelling is pronounced /l1vz/), (ale/good/house/mid-)wives 
(but if housewife ‘sewing kit’ pronounced /‘hazif/ has a plural it is 
presumably pronounced /'hazifs/). 


Why the irregular words in the third category in this list with no <e> in the 


singular have an <e> in the plural is unclear (i.e. they could be spelt “elvs, 


“leavs, etc.), unless verbs like calve, halve, leave, salve, shelve, thieve and 


their singular forms calves, halves, leaves, salves, shelves, thieves influence 


the plural nouns, or the strong prohibition on word-final <v> (see above) 


extends to stem-final position in plurals. 
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A couple of the nouns just listed have alternative, regular plurals: dwarfs, 
turfs, and the only plural of lowlife is said to be lowlifes (though | have seen 
the form /owlives in print). 

Conversely, roofs, which (officially) has only that regular plural spelling, 
has both the regular pronunciation /ru:fs/ and the irregular pronunciation 
/rurvz/, but the latter pronunciation is hardly ever recognised in the spelling 
as “rooves (though this form has been printed twice in The Guardian: (1) 18 
July 2009, main section, p.39 (in a puzzle); (2) 26 September 2009, Review 
section, p.7 (in a poem); internet exploration revealed various people 
wondering if “rooves or roofs was the plural spelling because they say 
/ru:vz/). This pronunciation and the spelling roofs provide the only other 
example, besides of, of /v/ spelt <f>. 


3.7.8 /z/ as in Z00 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic grapheme  <z> 5% regular in word-initial position, 
(with e.g. zoo (only exceptions: sorbet 
<ze>- pronounced /'z>:be1/ (also 


for this, pronounced /'sd:be1/), sauerkraut 

see below if pronounced with German /z/, and 

under the words in <x-> listed below); 

Oddities) medially, only in amazon, azyme, 
bazooka, bedizen, benz-ene/ol, bezel, 
bezique, blazer, bombazine, bonanza, 
brazen, cadenza, chimpanzee, coryza, 
crazy, denizen, enzyme, extravaganza, 
frozen, gazebo, gazump, gizmo, 
(hap)hazard, hetero-/mono-zygous, 
influenza, lazy, lizard, magazine, 
mazurka, muzak, ozone, phizog, plaza, 
protozoa, razor, samizdat, schnauzer, 
spermatozoon, stanza, teazel/teazle 
(also spelt teasel, trapez-ium/oid, 
vizard, vizier, vizor (also spelt with 
<s>), wizard, wizen(ed), zigzag; 
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Other frequent 
graphemes 


<s> 


<se> 


93% 
(with 
<se>) 


word-finally without a following 

<e> only in fez, phiz, quiz, topaz, 
whiz, the abbreviated compound 
word showbiz and a few other very 
rare words; phonemically word-final 
within split digraphs, only in amaze, 
blaze, craze, daze, gaze, glaze, graze, 
haze, laze, maze, raze, trapeze; 
cloze, doze, froze, plus the four nouns 
ending in /atz/ always spelt <-ize> 
(assize, capsize, prize, size) and the 
large number of verbs ending in /atz/ 
spelt <-ize> (almost all of which have 
alternative spellings in <-ise>) 


word-initial only in sorbet if 
pronounced /'zo:bet/and sauerkraut 
if pronounced with German /z/ 
Regular 

(1) in medial position, e.g. chisel, 
preside, seismic, talisman; 

(2) word-finally (see above for 
exceptions with <z>, and below for 
all other exceptions); 

(3) in various suffixes and contracted 
forms after a vowel or non-sibilant 
voiced consonant - see Notes 


never word-initial or -medial (except 
medially in compound words, e.g. 
gooseberry /‘guzbri:/), housewife 
‘sewing kit’, pronounced /‘hazif/; 
also in miserable if pronounced 
/'mizrabal/ - see section 6.10); 
regular word-finally in content words 
after a long vowel or diphthong spelt 
with a digraph, e.g. blouse, bruise, 
cause, chauffeuse, cheese, choose, 
drowse, noise, parse, please, raise 


Doubled spelling 


THE REST 


Doubled spelling + <e> 


Oddities 
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<zz> <1% 


<cCZ> 


<sc> 


<Ss> 


<ts> 


<xX> 


regular at the end of one-syllable 
words after a short vowel spelt with 
one letter - but see Notes; also 
regular before word-final /al/ spelt 
<-le> after a short vowel spelt with 
one letter, e.g. dazzle (see section 
4.3.3); otherwise only in blizzard, 
buzzard, dizzy, fizzog, gizzard, 
grizzly, mizzen, muezzin, muzzy, 
pizzazz, razzmatazz, scuzzy, snazzy, 
tizzy, wazzock. For <zz> arising from 
consonant-doubling before a suffix 
see section 4.2 


(does not occur) 
2% in total 
only in czanina) 


only in crescent pronounced /'krezant/ (also 
pronounced /'kresant/) 


only medially and only in Aussie, brassiere, 
dessert, dissolve, hussar, Missouri, possess, 
Scissors 


only in tsar 


word-initially, only in some words of Greek 
origin, namely xanthine, xanthoma, xanthophyll, 
xenon, xenophobia and several other words 
beginning xeno-, Xerox and several other words 
beginning xero-, xylem, xylene, xylophone and 
several other words beginning xylo-. Medially, 
only in anxiety pronounced /zn'zarjiti:/ (also 
pronounced /zn'gzaijjtti:/). 

French loanwords ending in <-eau> are 
sometimes in the plural written French-style 
with /z/ spelt <x> rather than <s>, e.g. 
beau-s/x, bureau-s/x, flambeau-s/x, gateau-s/x, 
plateau-s/x, portmanteau-s/x, trousseau-s/x; 
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indeed, my dictionary gives only the <x> 
form in bandeaux, chateaux, rondeaux, 
tableaux. In my opinion the <x> form is 
outmoded and unnecessary 


<ze> only word-final and only in adze, bronze 
after consonant phonemes, plus baize, 
booze, breeze, freeze, frieze, furze, gauze, 
maize, ooze, schmooze, seize, sleaze, sneeze, 
snooze, squeeze, wheeze after long vowels or 
diphthongs spelt by digraphs - in all these 
words the <e> is phonographically redundant 
but spellings without it would look odd. In the 
hundreds of verbs ending /atz/ which may 
or must be spelt with <-ize>, plus the few 
other stem words where <z> appears within 
split digraphs (see above), it is unnecessary to 
analyse the <z> as also being part of digraph 
<ze> because the <z> spells /z/ without the 
<e> - but see Notes 


2-phoneme graphemes /gz/ <1% see under /g/, section 3.5.3 
spelt <x, xh> 


/1z/ only, following an apostrophe, in regular 

spelt <s> singular and irregular plural possessive 
forms ending in a sibilant consonant 
(/s, z, J, 3, tf, d3/), e.g. Brooks’s (book), jazz’s 
(appeal), Bush’s (government), (the) mirage’s 
(appearance), (the) Church’s (mission), (the) 
village’s (centre), (the) geese’s (cackling). See 
Notes 


NOTES 


/z/ is the phonological realisation of various grammatical suffixes (regular 
noun plural and third person singular person tense verb (both spelt <s> 
where the stem ends in <e>, otherwise <es>), regular singular and irregular 
plural possessive (spelt <’s>)), and of is, has when contracted (also spelt 
<’s>), after any vowel or voiced non-sibilant consonant (/ob dgIl mnnvd/). 
As just shown, in all these cases the spelling contains <s>. 

However, because /z/itself is a sibilant consonant, adding any of the 
suffixes just listed to a stem ending in /z/ adds a syllable /1z/ as well as 
a morpheme: fuses, quizzes, jazz’s. On this and the topic of the previous 
paragraph see also /s/, section 3.7.6, and /1/, section 5.4.3. 
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<zz> is regular word-finally in one-syllable words after a short vowel 
spelt with one letter, but there are only seven words in this set (buzz, fizz, 
frizz, fuzz, jazz, tizz, whizz, plus two polysyllables: pizzazz, razzmatazz) 
and there are 10 counter-examples (as, fez, has, his, is, phiz, quiz, was, whiz 
and cos, the abbreviation of because; cos, the lettuce and the abbreviation 
of cosine, vary in pronunciation between /kpoz/ and /kos/) - but three 
of the counter-examples (has, is, was) are verbs following the rule that 
grammatical endings in /z/ are spelt <s>. 

<zz> is also regular before word-final /al/ spelt <-le> after a short 
vowel spelt with one letter (see section 4.3.3), and very rare other than in 
this and the context mentioned in the preceding paragraph. 

<z> is regular in word-initial position, and very rare elsewhere, except 
in one class of verb stem endings. In British English, almost all the verbs 
whose stems end in /atz/ can be spelt with either <-ise> or <-ize>, e.g. 
atomise/atomize. There are a few which can only be spelt with <-ize>, 
namely capsize, prize, size, and a larger group which can only be spelt with 
<-ise>, namely advertise, advise, apprise, chastise, circumcise, comprise, 
compromise, despise, devise, enterprise, excise, exercise, franchise, 
improvise, incise, merchandise, premise (/pri'maiz/ ‘base argument upon’), 
prise, realise, revise, rise, supervise, surmise, surprise, televise. 

Exceptions to the <-ise/-ize> choice are a few verbs which are mainly 
spelt with <-yse> in British English (though US spellings in <-yze> are 
becoming commoner): analyse, breathalyse, catalyse, dialyse, electrolyse, 
paralyse. And plural nouns and singular verbs whose stems end in /a1/ and 
which therefore end in /atz/ when suffixed (e.g. dies, dyes, lies, vies) follow 
different rules. 

Of the 15 words ending /e1z/ spelt with a split digraph, 11 are spelt with 
<-aze> (amaze, blaze, craze, daze, gaze, glaze, graze, haze, laze, maze, 
raze), and 4 with <-ase> (erase, phase, phrase, ukase). Vase appears to be 
the only word ending in /a:z/ (in RP). 

The only word in which word-final /i:z/ is spelt <-eze> is trapeze, and 
cerise, chemise, expertise, reprise, valise appear to be the only ones in which 
it is spelt <-ise>. Besides these spellings in which /i:/ is spelt by a split 
digraph (and in theory /z/ is spelt only by the <z>, though it is immaterial 
whether one instead recognises the <e> as part of digraph <ze> and 
therefore as dual-functioning), word-final /i:z/ has four further spellings 
in which /i:/ is spelt by a non-split digraph, and /z/ definitely by either 
<ze> (breeze, freeze, sneeze, squeeze, wheeze; frieze) or <se> (appease, 
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(dis)ease, (dis)please, heartsease, pease, tease; cheese). In all other cases, 
of which there are dozens, especially nationality/language words, plus 
these, word-final /i:z/ is spelt <-ese>. This may seem to be one of the 
few cases where the spelling of a final /VC/ pattern is more predictable as 
a unit than from correspondences for its two phonemes, but actually it is 
predictable from them: the regular spelling of /i:/ in closed final syllables 
of polysyllables is <e.e> (see section 5.7.2), and the regular spelling of 
phonemically word-final /z/ is <s>. See also section A.7 in Appendix A. 

There are only 2 words ending /auz/ spelt <-oze>: doze, froze; the rest 
are spelt with <-ose>, e.g. chose, close (verb), hose, pose, prose, rose, those. 

There are no words ending /(j)u:z/ spelt <-uze>, and only 1 spelt 
<-ose>: Jose; the rest are spelt with <-use>, e.g. abuse (verb), accuse, 
amuse, bemuse, excuse (verb), enthuse, (con/dif/ef/in/suf-) fuse, hypotenuse, 
muse, peruse, refuse (verb), ruse, use (verb). 

In all cases other than those listed, the regular spelling of medial and 
final /z/ is <s>, including the grammatical suffixes mentioned above. 


3.8 Consonants without doubled spellings: 
/hos3e0dwj/ 


3.8.1 /h/ as in who 


Occurs only before a vowel phoneme, therefore never word-finally. 


THE MAIN SYSTEM 


Basic grapheme <h> 97% e.g. behave, have 


Other frequent graphemes (none) 


THE REST 
Oddities <j> only in fajita, jojoba (twice), 
marijuana, mojito, Navajo 


<wh> 3% - only in who, whom, whose, 
whole, whoop(er/ing), whore 


2-phoneme graphemes (none) 
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NOTES 


/h/ is rare medially, but cf. adhere, behave, behind, bohemian, cahoots, 


clerihew, cohere, cohort, enhance, inhabit, inherit, mayhem, perhaps 


pronounced /pa'heps/ rather than /preps/, prehensile, shanghai, and 


compound words such as anyhow, meathook, mishap, mishit, peahen, 


poorhouse, prehistoric, sawhorse, sunhat, warhead, warhorse. 


Because Carney (1994) did not include function words such as who, 


whom, whose in his frequency counts, his percentage for /h/ spelt <wh> is 


distinctly lower than if he had included them. 


3.8.2 /n/ as in ring 


Occurs only post-vocalically, and therefore never word-initially (in English). 


Also, except in very rare cases such as spraing, occurs only after short 


vowel phonemes (and even then never after /u/ (in RP); also very rare after 


/e, a/ - see Notes) 


THE MAIN SYSTEM 


Basic grapheme <ng> 75% 


Other frequent <n> 25% 
grapheme 


THE REST 

Oddities <nc> 
<nd> 
<ngh> 
<ngu> 
<ngue> 


e.g. bang, sing, zing, long, lung 


before /k, g/, however spelt, e.g. sink, 
zinc, anxious, conquer, ankle, uncle, 
length; longer, kangaroo, anxiety. See 
Notes 


only in charabanc /‘Se#raben/ 


only in handcuffs, handkerchief 
/‘henkafs, 'henkeatfrf/ 


only in dinghy, gingham, Singhalese 

/‘dini:, 'ginam, sina'lizz/ (contrast shanghai 
/Seen'hat/) 

only in a very few suffixed forms of words in 
next category, namely haranguing, tonguing. 
See also end of section 6.4 


only in harangue, meringue, tongue /ha'ren, 
ma'ren, tan/ (contrast dengue /'denget/) 


88 Dictionary of the British English Spelling System 


2-phoneme (none) 
graphemes 


NOTES 


The conclusion that /n/ before /k/ is spelt <n> is based on words like ankle 
/‘enkal/, carbuncle, crinkle, peduncle, periwinkle, rankle, sprinkle, tinkle, 
twinkle, uncle, winkle, wrinkle, where /k/ is clearly spelt <k, c>, so that the 
preceding /n/ must be represented by the <n>. Then the same analysis 
must apply to angle /'zngal/, even though this means that here the letters 
<n, g> do not form the grapheme <ng> and do not jointly spell /n/. The 
same applies to finger /'finga/, but in singer /'stna/ there is no /g/ (in 
RP, though there is in Lancashire), so that here the letters <ng> do form a 
single grapheme representing /n/. 

The words Jength, lengthen, strength, strengthen pronounced 
/lenk®, ‘lenkOan, strenk®, 'strenk@an/ (for their alternative pronunciations see 
under /n/, section 3.5.5) and angst /enkst/ appear to need an analysis in 
which /n/ is spelt <n> and /k/ is spelt <g> - if so, these words and disguise 
/dis'ka1z/, disgust pronounced /dis'kast/, i.e. identically to discussed (disguise, 
disgust are also pronounced /d1z'gaiz, diz'gast/, i.e. with both medial 
consonants voiced rather than voiceless) would be the only occurrences of /k/ 
spelt <g>, though the spelling of /n/ with <n> in angst, length, lengthen, 
strength, strengthen conforms to the analysis of the many words with this 
correspondence just given (see also under /k/ in section 3.7.1). 

The words length, lengthen, strength, strengthen are also among the 
very few in which /n/ occurs after /e/. The only other examples seem 
to be dengue, dreng ‘free tenant in ancient Northumbria’, enchiridion, 
enclave pronounced /'enkleiv/ (also pronounced /'onkletv/), enkephalin, 
ginseng, nomenclature, the abbreviation SENCo (‘Special Educational Needs 
Coordinator’) pronounced /'senkau/, and words beginning encephal- when 
pronounced /en'kefal-/ (also pronounced /en'sefal-/). The only cases 
in which /n/ follows /a/ may be words like concur(rent) if pronounced 
/kan'k3:, kan'karant/. 

Although long, strong, young end in /n/ (in RP) and are therefore to be 
analysed as containing /n/ spelt <ng>, the comparative and superlative 
forms longer, longest, stronger, strongest, younger, youngest and the 
verb elongate all have medial /ng/, so here the /g/ has ‘surfaced’ and is 
represented by the <g> (see section 7.2), and the /n/ is spelt <n>. Similarly 
with diphthongise, prolongation, which gain a /g/ relative to unsuffixed 
diphthong, prolong. By contrast, in longevity /lpn’dgeviti:/ the surfacing 
phoneme is /d3/. 
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The word anxiety has two pronunciations: /aen'gzatiti:, en'zariti:/, 
where the second lacks /g/. Both are rarities: the first is the only instance of 
/qg/ where the /g/ is not spelt <g>; the second is the only case where /n/ 
is spelt <n> without a following /k, g/. 


3.8.3 /J/ as in fission 


THE MAIN SYSTEM 


Basic grapheme <sh> 37% e.g. ship, fish, regular in initial 
and final positions; rare medially, 
but cf. ashet, baksheesh, 
banshee, bishop, buckshee, 
Bolshevik, bolshie, bushel, cashier, 
cashmere, cushion, dasheen, 
dishevel, fashion, geisha, kosher, 
kwashiorkor, marshal, pasha, 
pashmina, ramshackle, sashay, 
worship, yashmak and words with 
the suffix -ship. For exceptions in 
initial and final positions see the 
Oddities, below. Also see Notes 


Other frequent  Allthese graphemes For all these categories see Notes 


graphemes occur only medially and Table 3.6 
<ti> 55% regular medially, e.g. nation, but 
(with <ci, there are many exceptions 
si, ssi>) 
<ci> e.g. commercial, crucial, delicious, 


judicial, logician, magician, 
official, racial, social, special 


<si> e.g. aversion, emulsion, pension, 
repulsion, reversion, tension, 
torsion, version 


<ssi> e.g. accession, admission, 
discussion, emission, intercession, 
obsession, passion, percussion, 
permission, recession, remission, 
session 
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Rare grapheme 


THE REST 


Oddities 


- in initial position 


- in medial position 


<ce> 


<ch> 


<s> 


<sch> 


<sj> 


<c> 


<ch> 


<che> 


regular medially in /erfas/ spelt <-aceous>, 
e.g. cretaceous, curvaceous, herbaceous, 
sebaceous - see Notes and Table 3.6; otherwise 
only in cetacean, crustacea(n), Echinacea, 
ocean, siliceous 


8% in total 


only in 30+ words of mainly French origin, namely 
chagrin, chaise, chalet, chamfer pronounced /'feemfa/ 
(also pronounced /'t}amfa/), chamois (whether 
pronounced /'femi:/ or /'faamwa:/), champagne, 
chancre, chandelier, chaperone, charabanc, charade, 
chardonnay, charlatan, Charlotte, chassis, chateau, 
chauffeu-r/se, chauvinis-m/t, chef, chemise, chenille, 
cheroot, chevalier, chevron, chi-chi (twice), chic, 
chicane(ry), chiffon, chignon, chivalr-ic/ous/y, chute 


only in sugar, sure and (German pronunciations of) spiel, 
stein, strafe, stumm 


only in schedule (also pronounced with /sk/), 
schemozzle, schist, schistosomiasis, schlemiel, schlep, 
schlock, schmaltz, schmo(e), schmooze, schnapps, 
schnauzer, schnitzel, schnozzle, schuss, schwa 


only in sjambok 


e.g. officiate, speciality, specie(s), superficiality and 
sometimes ap/de-preciate, associate. See Notes 


only in about 20 words of mainly French origin, namely 
attaché, brochure, cachet, cachou, cliché, crochet, 
duchesse, echelon, embouchure, Eustachian, machete, 
machicolation, machine, marchioness, nonchalant, 
parachute, pistachio, recherché (twice), ricochet, ruching, 
sachet; also sometimes in (Greek) chiropody (hence the 
punning shop name Shuropody) 


only in rapprochement 
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<chs> only in fuchsia 


<s> only in asphalt pronounced /'z#Jfelt/ (also pronounced 
/‘esfelt/), censure, commensurate, ensure, insure, 
tonsure 


<sc> only in conscie, conscientious, crescendo, fascis-m/t. See 
Notes 


<sch> only in maraschino, meerschaum, seneschal 
<sci> only in conscience, conscious, fascia, luscious. See Notes 


<se> only in gaseous pronounced /'getsas/ (also pronounced 
/‘gesitjas/). See Notes 


<ss> only in assure, fissure, issue, pressure, tissue 


<t> mainly before <-iate> with the <i> spelling /i:/ (and 
with ‘invisible’ /j/-glide), e.g. differentiate, expatiate, 
ingratiate, initiate, negotiate, propitiate, satiate, 
substantiate, vitiate, plus minutiae, otiose pronounced 
/‘avfitjaus, 'sufsitjauz/ (also pronounced /'auti:jaus, 
‘auti:jauz/), partiality, ratio; also novitiate pronounced 
/na'vifizjat/ (also pronounced /na'vifat/). See Notes 


- in final position 
<ce> only in liquorice pronounced /'Itkar1f / 
<ch> only in Welch and, in phonemically word-final position, 
fiche, gouache, moustache, niche pronounced /ni:J/, 


pastiche, quiche, ruche, where the <e> is part of the 
split digraphs <a.e, i.e, u.e> spelling /a:, iz, ur/ 


<che> only in about 12 words of mainly French origin, namely 
avalanche, barouche, brioche, cache, cartouche, cloche, 
creche, douche, farouche, gauche, louche, panache 


2-phoneme /kJ/ 
graphemes 
(1) only in flexure, luxury, sexual /'flekfa, 'Iakfari:, 
spelt 'sekf(urw)al / 
<x> 
(2) e.g. anxious: see under /k/, section 3.7.1 
spelt 


<xi> 
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NOTES 


Because /J/ is a sibilant consonant, adding any of the suffixes regular noun 
plural and third person singular person tense verb (both spelt <s> where 
the stem ends in <e>, otherwise <es>) and regular singular and irregular 
plural possessive (spelt <’s>) to a stem ending in /{/ adds a syllable /1z/ 
as well as a morpheme: quiches, fishes, Bush’s. See also /z/, section 3.7.8, 
and /1/, section 5.4.3. 

Some rules could probably be given for when <sh> is regular medially, 
but these would be more complicated than giving the list of examples, above. 

As spellings of medial /f/ in stem words, <ti, ce, ci, sci, se, si, ssi> occur 
only at the beginning of the final syllable of a word and immediately after the 
stressed syllable, and the final syllable is always one of /al, an, as/ or (very 
rarely) /am/ (consortium pronounced /kan'sd:fam/ (usually pronounced 
/kan'sa:titjam/), nasturtium), /ans/ (conscience) or just /a/ (e.g. consortia 
pronounced /kan'so:fa/ (usually pronounced /kan'so:titja/), fascia, militia); 
and <si> is always preceded by a consonant letter (except in Asian). 
Exceptions with these features but other spellings of medial /{/: bushel, 
marshal, seneschal, cushion, Eustachian, fashion, fissure, fuchsia, geisha, 
pressure. 

The default spelling of medial /J/ is <ti>; for example, it is regular in 
words ending in /erfan, erfal, ixfan, aufan, (jjurfan/, e.g. nation, spatial, 
accretion, lotion, evolution, pollution (exceptions: Asian, racial, cetacean, 
crustacean, Grecian, ocean, Confucian, Rosicrucian). However, because 
medial /f/ has so many other spellings, the major patterns are set out 
in Table 3.6. What does not come over particularly clearly even then is 
that the only case where there is substantial three-way confusion is over 
final /1fan/: the majority spelling is <-ition>, e.g. volition, but there is 
competition (!) from <-ician, -ission> - see the top right-hand and bottom 
left-hand boxes of the Table (and beware of Titian). 
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TABLE 3.6: THE DISTRIBUTION OF <ti, ce, ci, si, ssi> AS SPELLINGS OF MEDIAL /J/. 


Default spelling: <ti> Exceptions (in addition to those in <sh> listed under the basic 
grapheme and those listed among Oddities above or under 
Subpatterns and Sub-exceptions below): 

<ci>: facial, glacial pronounced /'gletfal/ (also pronounced 
/'gletsitjal/), racial, (e)special, financial, provincial, social, 
commercial, crucial, Grecian; academician, electrician, logician, 
magician, mathematician, mortician, musician, obstetrician, 
optician, patrician, phonetician, physician, politician, statistician, 
tactician (N.B. most words in <-ician> are occupations); 
suspicion, Confucian, Rosicrucian, precious, specious, siliceous; 
auspicious, avaricious, capricious, delicious, judicious, malicious, 
meretricious, officious, suspicious, vicious, atrocious, ferocious, 
precocious and various other rare words 

<si>: controversial, torsion 


<ssi>: fission 


Each of the subpatterns below is an exception to the rule that the default spelling is <ti>, 
and each subpattern has its own sub-exceptions (some of which revert to <ti>) 


Subpattern Sub-exceptions 

For /etfas/ the regular spelling is <-aceous>, audacious, capacious, 

e.g. cretaceous, curvaceous, herbaceous, sebaceous contumacious, efficacious, 

plus about 100 other words, mostly scientific and all fallacious, gracious, loquacious, 
very rare mendacious, perspicacious, 


pertinacious, pugnacious, 
rapacious, sagacious, tenacious, 
vivacious, voracious; 

gaseous, Spatious, Ignatius 


For /1fal/ the regular spelling is <-icial>, e.g. artificial,| initial plus 4 other rare words in 
beneficial, (pre) judicial, official, sacrificial, superficial <-itial> 
(but this is a very small set) 


For /fan/ preceded by /3:/ spelt <er, ur> or by /I, n/ Cistercian, coercion (also 


spelt <I, n>, the regular spelling is <-sion>, e.g. (a/ pronounced with /3/), exertion, 
re-)version, excursion, emulsion, expulsion, pension, Persian (also pronounced with 
tension /3/), tertian; 

gentian 


For /fan/ preceded by /x, e, mt, A/ spelt <a, e, mi, u>, 
the regular spelling is <-ssion>, e.g. (com) passion; national, ration, (inrational: 
(ac/con/inter/pro/re/se/suc)cession, con/pro-fession, 
ag/di/e/in/pro/re/retro/trans-gression, 
com/de/ex/im/op/re/sup-pression, session and all its 
compounds; 
(ad/com/e/inter/intro/manu/per/re/sub/trans) mission; 
con/disf re) per-cussion Prussian, Russian 
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Spellings of medial /J/ with <c, sc, t> always have a following <i(e)>, but 
the <i(e)> is a separate grapheme spelling /i:/. Some of the relevant words 
have alternative pronunciations with /s/, e.g. appreciate /a'pri:fixjert, 
a'pritsisjert/, negotiate /ni'gaufitjert, ni'gausitjert/, species /'spizfirz, 
'spirsitz/. 

And then there seems to be a phonological constraint in many 
RP-speakers’ accents against medial /{/ occurring twice in words ending 
in /ixjett/ which already have one medial /f/ and then would acquire 
another if suffixed to end in /i:'jerfan/. For example appreciation and 
negotiation are mainly pronounced /apritsi:'jerfan, nigausi:'jerfan/, not 
/aprisfis'jetfan, nigaufi:'jerfan/. But this in turn does not apply in words 
which do not end in /ertfan/ spelt <-ation>: conscientious is always 
pronounced with two occurrences of medial /f/: /konfi:'jenfas/, and 
recherché obviously has two: /ra'feafer/. The constraint also clearly does 
not apply to words with the -ship suffix, e.g. relationship /r1'letfanfip/. 


3.8.4 /3/ as in vision 


The least frequent phoneme in spoken English. 


THE MAIN SYSTEM 


For all three categories see also Notes. 


Basic grapheme <si> 91% e.g. freesia, vision. Only medial 
(with <s>) 


Rare graphemes <s> only medial before <u> and only in 
casual, usual, visual: (dis/en/fore-) 
closure, composure, embrasure, 
erasure, exposure, leisure, 
measure, pleasure, treasure, 
treasury, usur-er/y/ious 


<ge> 4% never initial; medially, only in 
bourgeois(ie), mange-tout, regular 
in word-final position, where it 
occurs only in about 25 words of 
mainly French origin, namely beige, 
cortege, concierge, liege, melange, 


THE REST 


Oddities 


2-phoneme grapheme 
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<ci> 


<g> 


<j> 


<se> 


<ti> 


<Z> 


<zi> 


/g3/ spelt <x> 


rouge and, with the <e> also 
forming part of the split digraphs 
<a.e, i.e, u.e> (for dual- 
functioning see section 7.1), in 
badinage, barrage, camouflage, 
collage, corsage, decalage, 
décolletage, dressage, entourage, 
espionage, fuselage, garage 
pronounced /'gxra:3/, massage, 
mirage, montage, triage, sabotage; 
prestige; luge; only exception in 
word-final position is raj /ra:3/ 


5% in total 


only, exceptionally but increasingly, 
in coercion pronounced /kau'w313an/ 
(usually pronounced /kau'w3:fan/) 


initially, only in genre, gilet; medially, 
only in aubergine, conge, dirigiste, 
largesse, negligee, protege, regime, tagine 
and lingerie pronounced /‘lzenzari:/ 

(also pronounced /'lpndgarer/); never 
word-final 


only in jihad, raj and some rare French 
loanwords, e.g. bijou, goujon, jabot, 
jalousie, jupe 


only in nausea, nauseous pronounced 
/'nd13a(s)/ (also pronounced /'ndzi:ja(s)/) 


only in equation /1'kwe1zan/ 


only in azure pronounced /'xzZ~a, 'e139/ 
(also pronounced /'xzj(u)a, ‘e1zj(u)a/), 
seizure /'sit3a/ 


only in brazier, crozier, glazier pronounced 
/‘breiza, 'krauza, 'gle1za/ (also pronounced 
/‘breizitja, 'krauzisja, 'glerzi:ja/) 


see under /g/, section 3.5.3 
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NOTES 


Because /3/ is a sibilant consonant, adding any of the suffixes regular noun 
plural and third person singular person tense verb (both spelt <s> where 
the stem ends in <e>, otherwise <es>) and regular singular and irregular 
plural possessive (spelt <’s>) to a stem ending in /3/ adds a syllable /1z/ as 
well as a morpheme: massages, (the) Raj’s (collapse). See also /z/, section 
3.7.8, and /1/, section 5.4.3. 

As spellings of /3/, <si, s> occur only medially and immediately after the 
stressed syllable, and are preceded by a vowel, and <s> is always followed 
by <u>. Almost all spellings with <si> are followed by <-on>, e.g. vision, 
but there are a few others, namely crosier, hosier(y), osier. 

Although Carney gives 91% for <si, s> combined, it is clear that the 
great majority of these must be <si> spellings, since there are rather few 
words with /3/ spelt <s> and a large number with /3/ spelt <si>. This is 
why | have classified /3/ spelt <s> as a rare grapheme. 

Treating <ge> as the regular spelling of word-final /3/ is justified by 
the first six words of French origin listed above: here the preceding vowel 
phonemes (plus the /n/ in melange) are represented without the aid of the 
word-final <e>. In the other 19 words <ge> is clearly still spelling /3/, but 
it is necessary (and parallels other parts of the analysis) to analyse the <e> 
as also forming part of the split digraphs <a.e, i.e, u.e> spelling /a:, ix, ux/ 
(even though the last two correspondences have only one instance with 
included <g> each) - for dual-functioning see section 7.1. Then | analyse 
the /e/ in cortege as spelt only by the first <e> because it is a short vowel 
and no short vowels (in my analysis) are spelt by split digraphs - see 
section A.6 in Appendix A. Then <g> has to be recognised as a grapheme 
spelling /3/ separate from <ge> because of the few words listed where this 
correspondence occurs initially and medially and the following vowel letters 
are obviously (involved in) separate graphemes. 

The spelling <zh> is also used to represent /3/, but because this occurs 
only in transcriptions of Russian names, e.g. Zhivago, Zhores, | have not 
added it to the inventory of graphemes. 


3.8.5 /8/ as in thigh 


THE MAIN SYSTEM 


Basic grapheme <th> 100% 


Other frequent graphemes_ (none) 
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THE REST 
Oddities <phth> only in apophthegm /'zpa8em/, phthalate 
/‘Ozlert/ 
<the> only in Catherine with first <e> elided (see 
section 6.10), saithe (/se10/, ‘fish of cod 
family’) 
2-phoneme grapheme /t6/ see under /t/, section 3.5.7 
spelt <th> 
NOTES 


In the rare word saithe, the only function of the <e> seems to be to keep 
this word visually distinct from saith (/se8/, archaic form of says) with the 
rare spelling <ai> for /e/. 

See also the Notes under /6/, next. 


3.8.6 /6/ as in thy 


THE MAIN SYSTEM 


Basic grapheme <th> 100% 


Other frequent graphemes_ (none) 


THE REST 

Oddity <the> <1% — only word-final and only in breathe, 
loathe, seethe, sheathe, soothe, 
Staithe, teethe, wreathe. See Notes 

2-phoneme graphemes (none) 

NOTES 


In all the words listed under Oddity, the vowel digraphs preceding <-the> 
spell a long vowel or diphthong, and there is therefore no need to analyse 
the final <e> as part of complex split graphemes <ea.e>, etc. However, 
in bathe, lathe, unscathed (the free form scathe meaning ‘to harm’ does 
not occur, but underlies both unscathed and scathing), swathe, blithe, 
lithe, tithe, writhe, clothe, hythe, scythe, the final <e> is part of the split 
digraphs <a.e, i.e, 0.e, y.e> spelling /er, al, au, at/, so here /0/ is again 
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spelt <th>. The final <e> keeps breathe, loathe, seethe, sheathe, soothe, 
teethe, wreathe, bathe, lathe, swathe, clothe visually distinct from breath, 
loath (also spelt /oth), seeth (/'si:j18/, archaic third person singular present 
tense form of see), sheath, sooth, teeth, wreath, bath, lath, swath, cloth. For 
the few minimal pairs differing only in having /8/ or /O/ see section 9.36. 

The fact that both /6/ and /6/ are spelt <th> is useful in writing: people 
whose accents have different distributions of the two phonemes nevertheless 
spell the relevant words identically. This is particularly the case with some 
plural nouns, e.g. baths pronounced /ba:6z/ in RP but /b#@s/ by many 
people from the North of England; the singular has /@/ in both cases. But 
this does not help people trying to read unfamiliar words containing <th> 
- though again see section 9.36. 


3.8.7 /w/ as in well 


Occurs only before a vowel phoneme, and therefore never word-finally 


THE MAIN SYSTEM 


For all these categories, and /w/ not represented at all, see Notes. 


Basic grapheme <w> 64% e.g. word. regular initially, rare 
medially 

Other frequent <u> 31%, e.g. quick, language. Never initial, 
graphemes of which 27 regular medially 

percentage 

points are 

occurrences 

of /kw/ spelt 

<qu> 

<wh> 5% medially, only in erstwhile, 


meanwhile, narwhal, over-/under- 
whelm; otherwise only initial and 
only in whack, whale, wham(my), 
whang, wharf, what, wheat, 
wheedle, wheel, wheeze, whelk, 
whelp, when, whence, where, 
wherry, whet, 


THE REST 


Oddities 


2-phoneme 
graphemes 
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<hu> 
<ou> 


<ww> 


/wa/ spelt <o> 


/wa:/ 


(1) spelt <oi> 


(2) spelt <oir> 


whether, whey, whiff, whiffle, 
Whig, which, while, whim, 
whimper, whimsical, whimsy, 
whin, whine, whinge, whinny, 
whip, whippersnapper, whippet, 
whippoorwill, whirl, whir(y, whisk, 
whisker, whisk(e)y, whisper, 
whist, whistle, whit, white, 
whither, whitlow, Whitsun, whittle, 
Whitworth, whiz(z), whoa, whomp, 
whoopee, whoops, whoosh, whop, 
whump, whup, why, whydah and 
a very few other rare words (e.g. 
whilom) 


<1% in total 
only in chihuahua (twice) 
only in Ouija 


only in bowwow, glowworm, powwow, 
skew(-)whiff, slowworm 


only in once, one - unless you prefer 
to consider the /w/ as not being 
represented in the spelling at all - see 
Notes and section 9.0 


See also Notes 


only in a few words more recently 
borrowed from French, e.g. bourgeoisie, 
coiffeu-r/se, coiffure, pointe, soiree, 
toilette 


mainly word-final and only in a very few 
words more recently borrowed from 
French, namely abattoir, avoirdupois, 
boudoir, memoir, noir, reservoir, soiree, 
voussoir. /r/-linking occurs in memoirist, 
noirish - see section 3.6 
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(3) spelt <oire> only word-final and only in a very 
few words more recently borrowed 
from French, namely aide-memoire, 
conservatoire, escritoire, grimoire, 
repertoire 


(4) spelt <ois> only word-final and only in a very few 
words more recently borrowed from 
French, namely avoirdupois, bourgeois, 
chamois (the animal, pronounced 
/‘Jzmwa:/, as opposed to the leather 
made from its skin, pronounced 
/‘Jzmi:/, the latter also being spelt 
shammy), patois (contrast fatwa). /z/ 
surfaces in bourgeoisie - see section 7.2 

/wat/ spelt <oy> only in foyer pronounced /'fwatjet/, 
voyeur. Here the <y> is both part of the 
digraph <oy> spelling /wa1/ and alsoa 
single-letter grapheme spelling /j/. For 
dual-functioning see section 7.1 


3-phoneme /'wata/ spelt with a single grapheme <oir> only 
grapheme in choir - one of only two 3-phoneme 
graphemes in the entire language 


NOTES 


If we follow Crystal (2012: 131-2), ‘more recently’ in terms of loanwords 
from French means after the Great Vowel Shift, which ended about AD 1600. 

Although phoneme /w/ never occurs at the end of an English stem word, 
letter <w> is very frequent in word-final position, where it always follows a 
vowel letter with which it forms a digraph spelling a vocalic sound. See also 
‘linking /w/’ later in these Notes. 

The 2-phoneme sequences /kw, gw/ are almost always spelt <qu, gu> 
respectively. 

<u> spelling /w/ occurs not only in the familiar /kw/ spelt <qu>, e.g. 
quick, squash, but also in a few words after /k, g, s, z/ spelt <c, g,$, SS, Z>, e.g. 
cuirass, cuisine, cuissse; anguish, distinguish, extinguish, guacamole, guano, 
guava, iguana pronounced /1'gwarna/, language, languish, linguist, penguin, 
Sanguine, Segue, unguent, persuade, pueblo, puissan-ce/t, pursuivant, suave, 
suede, suite; assuage, dissuade; Venezuela (usually pronounced with /z/) and 
some very rare words; otherwise perhaps only in ennui, etui /on'wi:, e'twir/. 
In these contexts <u> is clearly a consonant letter, though it is very rarely 
taught as having (like <y>) both vowel and consonant functions. 
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In linguistic terms it is unnecessary to analyse /kw/ as a single phonetic 
unit since in all words containing /kw/ (those where it is spelt <qu>, plus 
the Oddities acquaint, acquiesce, acquire, acquisitive, acquit (in these five 
words /k/ is spelt <cq>), awkward, coiffeur, coiffeuse, coiffure, cuisine, 
kwashiorkor and even choir) the /k/ is spelt separately, as it is also in 
compounds like backward. However, for teaching purposes many authors 
treat /kw/ and <qu> as units in close correspondence. Even though this 
ignores not only the (admittedly few) words just listed where /kw/ is not 
spelt <qu> (including acquaint, acquiesce, acquire, acquisitive, acquit) 
but also the 60+ words where <cqu, qu, que> are not pronounced /kw/ 
(see under /k/, section 3.7.1), this pragmatic approach to /kw/ spelt <qu> 
seems justified by the high frequency of this correspondence. 

The frequency of <wh> is actually higher than 5% because Carney (1994: 
253) did not count the function words what, when, whence, where, whether, 
which, while, whither, why. Scots and others who have the phoneme /m/ 
(think ‘hw’) in their accent presumably have little difficulty in knowing which 
words to spell with <wh>, but the rest of us just have to learn them. 

/wa/ also has 2-grapheme spellings, e.g. <wo> in wonder. /wa:/ has 
the 2-grapheme spellings <oi> in coiffeur, coiffeuse, coiffure, <ua> in 
guacamole, guano, guava, iguana, suave, and <wa> in kwashiorkor. /wata/ 
has the 2-grapheme spellings <wire> in wire and <uire> in acquire, quire, 
require. 

Medial /w/ spelt <w> is quite rare in stem words. The only words in which 
a medial <w> has only the function of spelling /w/ seem to be awkward 
(second <w>), fatwa, kiwi, kwashiorkor. There are rather more examples in 
which the <w> is both a grapheme in its own right spelling /w/ and part of 
one of the digraphs <ew> spelling /(j)u:/, e.g. in ewer, jewel, newel, sewer 
(/'su:wea/, ‘foul drain’), skewer, steward, or <ow> Spelling /au/ - in stem 
words this occurs only in bowie, rowan (in its English pronunciation /'rauwan/) 
- or /au/ in bowel, dowel, rowel, towel, trowel, vowel, bower, cower, dower, 
flower, glower, power, shower, tower, coward, dowager, howitzer, prowess, 
plus the Scottish pronunciation of rowan /'rauwan/. For words containing 
/aU, au/ see also sections 5.7.4, 5.6.2, and for dual-functioning section 7.1. 

There are also various stem words where medial /w/ occurs in the 
pronunciation as a glide between /(j)ur, au, au/ and a following vowel 
phoneme but has no representation in the spelling, for example: 

after /(j)u:/: altruism, bruin, casual, cruel, cruet, doing, dual, duel, 
duet, duo, fluid, genuine, grueK ling), iguana pronounced /Igju:'watna/, 
pirouette, ruin, silhouette, suet, toing, truant, (in)tuition, usual, 
zoology, zoomorphic 
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after /au/: coerce, coincide, coition, coitus, co-op, Cooperate, co-opt, 


coordinate, co-own, egoism, froing, going, heroic, heroin(e), jingoism, 


Noel, no-one, oboist, phloem, poem, soloist, spermatozoon, stoic(al) 


after /au/: devour, flour, hour, lour, our, scour, sour and dour 


pronounced /'dauwa/, all of which end in /'auwa/, plus the stray 


medial example of sauerkraut. 
This pattern of representing or not representing medial /w/ after /(j)ur, au, au/ 


before a following vowel phoneme is paralleled when stem words ending in 


those phonemes have a suffix beginning with a vowel phoneme added. In 


all such cases there is a ‘linking /w/’ glide between the stem and the suffix 
(see Cruttenden, 2014: 152), but this is represented in the spelling only 
if the stem word already ends in <w>, otherwise not. If there is a <w> it 


continues to function as part of the spelling of the stem-final vowel while 


also now spelling /w/ - the familiar dual-functioning (see section 7.1). If 


the stem does not end in <w> the spelling simply ignores the /w/-glide. 


For examples of both categories see Table 3.7. 


TABLE 3.7: EXAMPLES OF /w/ REPRESENTED OR NOT BETWEEN A STEM WORD 
ENDING IN /(@j)ur, 90, av/ AND A SUFFIX BEGINNING WITH A VOWEL PHONEME 


sewing; blowing, bowing (‘playing 
stringed instrument’), follow-er/ing, 
owing, shadowing, showing, 
sow-er/ing, willowy. 

N.B. Where the <ow> is not stressed, 
not only does linking /w/ occur but 
the vowel is often reduced to /a/ and 
is therefore spelt simply by the <o>, 
e.g. widower. 


Preceding /w/ represented in spelling by /w/ not represented in spelling 

phoneme(s) | <w> 

/(Gj)ur/ brew-er/ing, chewing, few-er/est, do-er/ing, lassoing, toing (and froing); 
hewer, Jewish, leeward pronounced | canoeing, shoeing; mooing, shampooing, 
/‘lu:wad/ (also pronounced shooing, tattoo-ing/ist; rendezvousing; 
/‘li:wed/), mewing, new-er/est, accru-al/ing, argu-able/ing, continu-al/ 
renewer, sinew-ous/y, stewing, ance/ing/ous, gluing, issu-able/ing, 
(inter/re-) view-er/ing pursu-ance/ing, rescuable, subduable, 

Statuesque, su-able/ing, virtuous 

/au/ allow-ance/ing, avowal, bowing Maoist; plough-er/ing 
(‘inclining the head’), miaowing 

/au/ sewer (/'sauwa/, ‘one who sews’), plateauing; hoeing, toeing; (toing and) 


froing, going 
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And this pattern also occurs when stem words ending in /(j)u:, au, au/ are 
followed by a word beginning with a vowel phoneme. In all such cases there 
is a ‘linking /w/’ glide between the two words, but again this is represented 
in the spelling only if the first word already ends in <w>, otherwise not. 
Since this book is mainly concerned with stem words and their derivatives, 
only a few examples of inter-word /w/-glides not represented in the 
spelling will be given: to estimate /tu'westimeit/, go away /'gauwea'wel/, 
slough of despond /'slaowevdis'pond/. 

See also the parallel phenomenon of ‘linking /j/’ (next section), and the 
subtly different one of /r/-linking (section 3.6). 


3.8.8 /j/ as in yell, union 


Occurs only before a vowel phoneme, and therefore never word-finally. 
For all these categories, the percentages, and /j/ not represented at all, 
see Notes. 


THE MAIN SYSTEM 
Basic word-initial <y> 19% e.g. yellow. Regular (very few 
grapheme exceptions) initially except 
before /uz, Ua/; rare medially 
Other frequent <i> c.5% e.g. onion. Only medial 
grapheme 
Frequent 2-phoneme _ /ju:/ See also the main entry for 
graphemes /jur/, section 5.7.5 
(1) 62% e.g. union, illusion, cute. 
spelt Regular as spellings of /jur/ 
<u, Uu.e> in non-final and closed final 
syllables respectively; <u> 
word-final only in coypu, 
menu, ormolu 
(2) 11% e.g. few. Regular as spelling 
spelt of /ju:/ word-finally in 
<ew> monosyllables 
Rare 2-phoneme /jux/ for regular as spelling of /ju:/ 
grapheme spelt percentage word-finally in polysyllables, 


<ue>  seebelow_ e.g. pursue 
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THE REST 


Oddities 


All medial 


<h> 


<j> 


<ll> 


Other 2-phoneme graphemes 


NOTES 


/jux/ 

spelt <eau, eu, ewe, ui, 
ut, uu> (with <ue>, 3% 
of spellings of /ju:/) 


/jue/ 
spelt <eur, ur, ure> 


/je/ 


spelt <eu, u, ua, ure> 


/\j/ spelt <II> 


/nj/ spelt <gn> 


<1% in total 


only in a very few words between 2 
vowels, namely annihilate, vehement, 
vehicle, vehicular 


only in hallelujah and majolica 
pronounced /mar'jolika/ (also 
pronounced /ma'dgplika/) 


only in French-like pronunciations 
of bouillabaisse /buxja:'bes/ and 
marseillaise /ma:set'jez/ and Latin 
American Spanish-like pronunciation 
of tortilla /to:'tizja:/ 


3% in total 


see under /ju:/, section 5.7.5 


see Notes below and under /ua/, 
section 5.6.5 


see under /a/, section 5.4.7 


only in carillon. Most occurrences 

of /\j/ (all medial) have one of the 
2-grapheme spellings <li, lli> - see 
the groups of words in the Notes 
beginning battalion, civilian. The 
sequence /I|j/ also occurs in halyard, 
failure but in both the /I/ is spelt 
<I>; in halyard the /j/ is spelt 
separately as <y>, but in failure is 
subsumed with the final /a/ in <ure> 


only in chignon, cognac, gnocchi, 
lasagne, lorgnette, mignonette, 
monsignor, poignant, seigneur, 
soigné, vignette 


Although /j/ never occurs at the end of an English stem word when 


pronounced alone or before a word beginning with a consonant phoneme, 
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letter <y> is very frequent in word-final position, where it always spells 
a vocalic sound, either singly or as part of a digraph. See also ‘linking /j/’ 
later in these Notes, and section 11.5. 
The correspondences for initial /j/ are fairly straightforward: 
if the word is a monosyllable /j/ is spelt <y>, e.g. yacht, yolk. Only 
exceptions: ewe, Ewell (contrast yew, you, Yule), uke, Ure, use (both 
the noun /jurs/ and the verb /ju:z/ and their derived forms) 
if the word is a polysyllable and the next phoneme is /u:/, the /j/ 
is usually subsumed into <u> spelling the 2-phoneme sequence 
/ju:/, e.g. union, university (exceptions: eucalyptus, eucharist, euchre, 
eugenic, eulogy, eunuch, euphemism, euphorbia, euphoria, Eustachian, 
euthanasia, ewer with /j/ subsumed instead into <eu, ew>; Yucatan, 
Yugoslav, Yupik with /j/ explicitly represented as <y>) 
if the word is a polysyllable and the next phoneme is /ua/, the /j/ is 
usually subsumed into <ur> spelling the 2-phoneme sequence /jua/ 
- but this category includes mainly urea and various words derived 
from it, e.g. urethra, urine, urology (exceptions: eureka, eurhythmic) 
otherwise, that is in polysyllables where the next phoneme is not 
/ux, Ua/, initial /j/ is almost always spelt <y>, e.g. yellow, etc. 
In medial positions, large numbers of instances of /j/ are subsumed into 
2-phoneme spellings of /ja, jua, jur/, and lists of these can be found in 
sections 5.4.7, 5.6.5 and 5.7.5 respectively. There are also a very few 
instances of other 2-phoneme spellings - see above. Otherwise, where a 
consonant phoneme precedes /j/ the predominant spelling appears to be 
<i>, e.g. in: 
a group of words ending /jan/ spelt <-ion> after a stressed syllable: 
battalion, billion, bunion, champion, companion, dominion, million, 
minion, onion, opinion, pavilion, pinion, union, vermilion 
a small group of words ending /jari:/ spelt <-iary> after a stressed 
syllable: apiary, auxiliary, aviary, breviary, domiciliary, pecuniary, 
topiary (contrast January with final /jari:/ spelt <-uary> and requiring 
/je/ to be analysed as spelt <ua>) (but for incendiary, intermediary, 
stipendiary, subsidiary pronounced with /dgari:/, i.e. with /dj/ 
affricated to /dg/, see section 3.7.4) 
a further ragbag: (with /lj/ spelt <(Dli>) civilian (3rd <i>), colliery, 
milieu, plus, in rapid speech, brilliant (2nd <i>); (others) behaviour, 
envious, piano, pronunciation (1st <i>), spaniel, (inter-/re-)view, etc. 
In these cases after a consonant <i> is clearly a consonant letter, though 
even more rarely than <u> is it taught as having (like <y>) both vowel 
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and consonant functions. There are a few exceptions with <y>: banyan 
/‘benjen/, biryani, canyon /'‘kenjan/, halyard, lanyard, vineyard. 
Otherwise, i.e. in situations not yet covered and where /j/ is both 
preceded and followed by vowel phonemes, the predominant spelling of 
medial /j/ is probably zero, i.e. it is not represented at all. There are a few 
exceptions among the Oddities above, plus just three with <i> (alleluia, 
onomatopoeia, pharmacopoeia, where the <i> can be taken to represent 
/j/ since the preceding <u, oe> spell /u:, i:/), and rather more with <y>: 
beyond, bowyer /'bauja/, lawyer, sawyer, yoyo, where <y> spells only 
/j/, plus bayonet, cayenne, crayon, mayonnaise, rayon, which can be 
economically analysed as having /e1/ in their non-final syllables spelt 
(regularly - see sections 5.7.1, 6.3) just by the <a>, so that /j/ is spelt 
just by the <y> 
abeyance, where /j/ is spelt <y> but the <y> is also part of <ey> 
spelling /e1/ 
arroyo, doyenne pronounced /do1'jen/, foyer pronounced /'foija/, 
loyal /'lotjal/, Oyer (and Terminen), royal /'ro1jal/, soya, where /j/ is 
spelt <y> but the <y> is also part of <oy> spelling /31/ 
coyote /kat'jauti:/, doyen and doyenne both pronounced /dwar'jen/, 
foyer pronounced /'fwaijet/, kayak /'katjek/, papaya, voyeur 
/vwatl'j3:/, where /j/ is spelt <y> but the <y> is also part of <ay, oy> 
spelling /at/. 

(For dual-functioning see section 7.1). 

A list of words in which /j/ is not represented at all in medial positions 
could go on for pages. A few examples (illustrating the range of spellings 
within which /j/ is invisible) are: (with final /a/) bacteria, cochlear, idea, 
linear, meteor, senior (all these words and many others would be analysed 
by Carney as ending in /1a/, whereas | assign them to /i:ja/); (others) 
aphrodisiac, appreciate, archaic, audience, axiom, caviar, chaos, chariot, 
create, creole, dais, deify, diary, dossier, foliage, genius, hilarious, jovial, 
lenient, mosaic, museum, odious, pantheon, period, radii, radio, reinforce, 
ruffian, serviette, simultaneous, society, soviet, spontaneity, stallion, tedium, 
triangle, triage, video. 

This pattern of representing or not representing medial /j/ between two 
vowel phonemes is paralleled when stem words ending in /ai, et, 31, ir/ 
have a suffix beginning with a vowel phoneme added. In all such cases 
there is a linking /j/-glide between the stem and the suffix, but this is 
represented in the spelling only if the stem word ends in a digraph ending in 
<y>, otherwise not. If there is such a digraph the <y> continues to function 
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as part of the spelling of the stem-final vowel while now also spelling /j/ 
- the familiar dual-functioning (see section 7.1). If the stem does not end 
in a digraph ending in <y> the spelling simply ignores the /j/-glide. For 


examples of both categories see Table 3.8. 


TABLE 3.8: EXAMPLES OF /j/ REPRESENTED OR NOT BETWEEN A STEM WORD ENDING 
IN /ar, e1, 91, i:/ AND A SUFFIX BEGINNING WITH A VOWEL PHONEME. 


Preceding /j/ represented in spelling by a | /j/ not represented in spelling 
phoneme digraph ending in <y> 
/at/ shanghaiing; higher, highest, sighing; 
beautifying, defying, dryer, dyer, flyer, 
fryer, supplying, plus: 
- some words obeying the 
<y>-replacement rule (section 6.5), 
e.g. alliance, amplifier, defiance, drier, 
flier, 
- the four words which obey the 
<ie>-replacement rule (section 6.6): 
dying, lying, tying, vying; 
- two Oddities: dyeing, eyeing. 
See also paragraph below Table 
/et/ betrayal, conveyance, layer, crocheting, inveighing, laity, neighing, 
layette, obeying, playing, prayer | ricocheting, segueing, weighing 
pronounced /'pretja/ (‘one who 
prays’), preying, purveying, 
Surveying 
/o1/ annoyance, boyish, buoyant, 
cloying, destroyer, enjoying, 
joyous, toying 
/ix/ jockeying, moneyer, volleying absenteeism, agreeing, beauteous, 


fleeing, fre-er/est, liar, nauseous 
pronounced /'n>:zi:jas/, orgiastic, 
precising, seeing, leylandii, fasciitis, 
Skiing, taxiing, plus many words 
obeying the <y>-replacement rule 
(section 6.5), e.g. acrimonious, 
bolshi-er/est, calumniate, carrier, 
centurion, comedian, dalliance, 
dutiable, enviable, historian, industrial, 
luxuriance, melodious, memorial, 
remedial, studious, twentieth, thirtieth, 
etc., variable, variance 
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The words with stems ending in <y> which I’ve placed under ‘/j/ not 
represented in spelling’ opposite /azt/ in Table 3.8 may seem not to 
belong there but in the other column. The <y> here could be analysed as 
representing both the /az/ and the /j/, but this would be the only example 
in my entire analysis of a letter having two single-letter functions - in all 
other cases the dual-functioning letter’s first function is as part of a di- or 
trigraph (see section 7.1). Also, none of the other words in this box would 
Support such an analysis, and as it happens none of the digraphs ending in 
<y> which spell /at/ occur word-finally (see section 5.7.3); hence there are 
no words to put in the left-hand box opposite /at/. 

Some further instances of ‘invisible /j/’ occur within suffixes, e.g. <-ial, 
-ian> /ixjal, ixjan/ (though again Carney would assign these to /19/). 

And this pattern also occurs when stem words ending in /al, e1, 51, ix/ 
are followed in running speech by a word beginning with a vowel phoneme. 
In all such cases there is a ‘linking /j/’ glide between the two words (see 
Cruttenden, 2014: 152), but again this is represented in the spelling only if 
the first word already ends in a digraph ending in <y>, otherwise not. Since 
this book is mainly concerned with stem words and their derivatives, only a 
few examples of inter-word /j/-glides not represented in the spelling will 
be given: | understand /atjanda'stend/, inveigh against /1n'verja'getnst/, 
‘hoi polloi’ is Greek /‘ha1pa'lotjiz'gri:k/, free offer /fri:'jofa/. 

See also the parallel phenomenon of ‘linking /w/’ (previous section), and 
the subtly different one of /r/-linking (section 3.6). 

The only percentage stated by Carney (p.256) is 19% for /j/ spelt <y>. In 
order to work out other percentages | have ignored non-represented /j/. For 
/j/ spelt <i> | deduced a percentage as follows: Mines et al. (1978, Table 
A-1, p.236) show that the ratio of initial to medial /j/ is about 4:1. Since 
/j/ spelt <y> is very rare medially, and the Oddities and the 2-phoneme 
graphemes spelling sequences other than /ju:/ are negligible, it is safe 
to take the ‘junior partner’ in this ratio as medial /j/ spelt <i>. Hence the 
figure of c.5% as the percentage for this spelling of /j/. 

It follows that 2-phoneme spellings of /ju:/ constitute most of the 
remaining 76%. Carney (p.201) states that <u, u.e> are 82% of spellings of 
/jux/, hence {82% x 76%} = 62% of the spellings of /j/, and that <ew> is 15% 
of spellings of /jux/, hence {15% x 76%} = 11% of the spellings of /j/. The 
remaining {100%-(19%+5%+62%+15%)} = 3% of spellings of /j/ are mainly 
the minority 2-phoneme spellings of /jur/, plus spellings of /ja, jua/. 


4. How do you know when 
to write a consonant 
letter double? 


For some people one of the main bugbears of English spelling is remembering 
to write, for example, 

accommodation not “accomodation 

occurred not ‘occured. 

In a national survey of adults’ spelling in England and Wales in 1995 
using just 15 words, accommodation produced the most errors; 68% of the 
people in the survey got it wrong (see Basic Skills Agency, 1996). 

Doubled consonant letters are a bugbear even though the doubled 
consonant spelling with the highest frequency for its phoneme is <Il> at 
only 18% of occurrences of /I/, and most other doubled consonant spellings 
are much less frequent. So in this chapter | provide some guidance on this 

- but be warned: the guidance does not and can not cover every word, so | 
end up saying ‘The rest you just have to remember, or check in a dictionary’. 


4.1 The easy bits 


4.1.1 Consonant letters are never doubled at the 
beginning of a word 


Well, hardly ever. There is [lama (the animal, as opposed to lama, a Tibetan 
monk), and /lano meaning a South American, treeless, grassy plain or 
steppe; also Welsh names like Ffestiniog and Lloyd - but I’m dealing with 
English, and not with names. 
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4.1.2 Some consonant letters are never or almost 
never written double: <h, j, q, v, w, x, y> 


The rule that these seven letters are rarely doubled applies to the whole of the 
rest of this chapter, but | will mention it again where necessary. Almost all the 
exceptions occur in compound words, for example bathhouse, beachhead, 
fishhook, hitchhiker, witchhunt and withhold, where the first <h> is always 
part of a digraph or trigraph ending the first element of the compound 
word; also bowwow, glowworm, powwow, skew(-)whiff (usually spelt with 
the hyphen, however) and slowworm. There are also a few slang words with 
<w>: bevy, bower, chivvy (also spelt chivy), civvy, divvy, flivver, luvv-y/ie, 
navvy, revving, savvy, skivvy, spivveny. Some brandnames deliberately flout 
this rule, e.g. Exxon. 


4.1.3 Doubled consonant letters are very rare after 
long vowels and diphthongs 


— in stem words, that is, though they do of course arise from compounding 
and suffixation, e.g. glowworm, keenness, preferring, really, referral, 
slowworm, warring. Perhaps the only classes of exception are monosyllables 
ending in /a:f, a:s, d:1/ (the first two of these apply in RP but in few other 
accents), which are mainly spelt <-aff, -ass, -all>, e.g. chaff, class, ball 
(see /f, s, |/, sections 3.7.3, 3.7.6, 3.7.5). There are also stray individual 
exceptions, e.g. bouffant, chauffeu-r/se, coiffeu-r/se, coiffure, feoffee, 
feoffment, pouffe, souffle; droll, plimsoll, poll, roll, scroll, stroll, toll; braille, 
camellia, chenille, Ewell, marseillaise, raillery, surveillance, thralldom (also 
spelt thraldom), tulle; arrhythmia if pronounced with initial /er/, potpourri; 
caisson if pronounced /'ketsan/ (also pronounced /ka'surn/), croissant, 
mousse, pelisse, renaissance if pronounced /ra'neisons/, trousseau, 
voussoir, aitch, retch if pronounced /ri:t{/; pizza. Words with final <rr, rre, 
rred, rrh> (charr, parr, err, chirr, shirr, skirr, whirr, burr, purr, barre, 
bizarre, parterre; abhorred, preferred, referred; catarrh, myrrh) may look 
like exceptions too, but here <rr>, etc., are part of the spellings of the long 
vowel or diphthong (see /r/, section 3.5.8). 

This rule also implies its converse, namely that single-letter spellings of 
consonant phonemes are regular after long vowels and diphthongs. 

However, unfortunately the counterparts to these rules are very unreliable: 
after short vowels both single and double consonant letters occur with great 
frequency, and much of the rest of this chapter is an attempt to grapple with that. 
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For all four circumstances see Table 4.1, and for the grapheme-phoneme 
direction see Table 10.4 in section 10.42. Also, for an extended historical 
explanation of why both single and doubled consonant spellings occur after 
short vowels, see Crystal (2012), chapters 7 and 8. 


TABLE 4.1: SINGLE AND DOUBLE CONSONANT SPELLINGS AFTER 
SHORT AND LONG VOWELS 


After long vowel/diphthong | After short vowel 

Single 

Both occur, and doubled 
consonant Regular 
letter spellings are sometimes 

bea predictable but mostly 

POU not - see the rest of this 
consonant Very rare 

chapter 
spelling 


4.2 The main consonant-doubling rule (Part 1 of 
‘double, drop or swop’ — see sections 6.4-5) 


[Acknowledgments: | owe the terms ‘consonant-doubling’, ‘<e>-deletion’ 
and ‘<y>-replacement’ largely to John Mountford, and the mnemonic 
‘double, drop or swop’ for them (though | may have re-ordered it) to Jennifer 
Chew. | also owe the following insight to John Mountford.] 

The three rules ‘double, drop or swop’ are mutually exclusive: no more 
than one of them can be applied to the same word at the same point (though 
a word with more than one suffix may exhibit more than one of them). 

The consonant-doubling rule applies only to single stem-final consonant 
letters and when they double before suffixes beginning with a vowel letter. For 
this rule, word-final <y> counts as a vowel letter but still obeys the restriction 
that it never doubles, but medial <u> spelling /w/ (after <q>; there seem to 
be no instances relevant here after <g>) counts as a consonant letter. The rule 
mostly involves the verb endings <-ed, -ing>, but also applies to: 

the adjective suffixes <-able, -est, -y>, as in regrettable, saddest, 
gassy, runny, Starry 

the noun suffixes <-age, -ance, -ation>, as in cribbage, scrummage, 
slippage, stoppage, tonnage; admittance, remittance, riddance; 
cancellation, (with medial <u> = /w/) quittance 

the verb suffix <-en>, in fatten, flatten, gladden, madden, redden, 
sadden 
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the noun and adjective suffix <-er>, as in plodder, sadder 

the noun and adjective suffix <-ery>, as in cattery, jewellery, jobbery, 
lottery, nunnery, piggery, shrubbery, slippery, snobbery, tannery, 
thuggery 

the noun suffix <-ess>, in goddess (only) 

the adjective suffix <-ous>, in libellous, marvellous (only). 


To state the main consonant-doubling rule neatly we need the convention 


that <C> means ‘any single consonant letter’, <C’> means ‘any consonant 


letter except those that never double’ and <V> means ‘any single vowel 


letter’. 


1) 


2) 


3) 


So then the main consonant-doubling rule is: 

In one-syllable words ending <CVC">, double the final consonant letter 
before any suffix beginning with a vowel letter; 

In two-syllable verbs ending <CVC’>, double the final consonant letter 
before any suffix beginning with a vowel letter if the last syllable of the 
stem is stressed or if (in British but not US spelling) the last letter of the 
stem is <I>. 

Otherwise, do not double the final consonant letter. 


The second part of the rule applies where the stem is a two-syllable verb, 


regardless of what part of speech the suffixed form is. 


Examples: 


words formed from one-syllable nouns and adjectives: clubbable, 
fitter, furry, gassy, goddess, mannish, matting, sadder, saddest, 
Skittish, starry, trekkie; (with medial <u> = /w/) quizzable, quizzical 
(even though the doubling makes these words break two other rules - 
see sections 4.4.5-6), quiddity, squaddie 

Extension: pittance has <tt> despite being derived not from a one- 
syllable English word but, via French, from the Latin noun pietas; at no 
stage before entering English did it have <tt>. 

Exception: /adette, which by the main consonant-doubling rule should 
be “laddette - but perhaps the shift of stress to the last syllable makes 
the difference. 

words formed from one-syllable verbs: banned, biddable, fitted, hopping, 
penned, plodder, riddance, riggable (of sails or an election), rotten, 
running, runny, slippage, slippery, starring, stoppage, swimmable, 
whammy, (with medial <u> = /w/) quipped, quittance, squatter 

Partial exception: Although the plural of the noun bus can only be spelt 
buses, the late 20'*-century conversion of this word into a verb caused 
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confusion over what its derived forms should be, so that dictionaries 
now show both buses, busing, bused and busses, bussing, bussed. The 
first set look as though the stem vowel might be pronounced /ju:/ 
rather than /a/, but the second set are all forms of the archaic verb to 
buss meaning ‘to kiss’ - take your pick. 

two-syllable verbs formed from one-syllable adjectives by adding 
<-en>: fatten, flatten, gladden, madden, redden, sadden 

Extension: Several past participles of irregular (‘strong’) one-syllable 
verbs are also formed this way: bitten, (fonbidden, ((mis)be/for/ill-) 
gotten, hidden, ridden, smitten, written. In all of these except (fon bidden 
the stem vowel phoneme changes. On the two-syllable verb forms in this 
bullet point and the previous one see also section 4.3.5. 

two-syllable verbs with stress on last syllable: abetted, abhorrence, 
admittance, allotted, beginning, committal, debarred, demurring, 
deterrence, forgetting, interred, occurred, recurrence, regrettable, 
transmittable; (with medial <u> = /w/) acquittal, equipped, equipping 
Contrast two-syllable verbs with stress on first syllable: coveted, 
focusing, focused, laundered, marketing, merited, targeted - some 
prefer the spellings “focussing, “focussed but the second <s> is 
unnecessary, since it is unlikely in this case (given that the stress 
is on the first syllable) that the spellings with <s> would suggest 
pronunciations with /ju:/ rather than /a/. Also contrast benefited 
(where again the doubling in “benefitted is unnecessary). 

two-syllable verbs ending in -fer stressed on last syllable: conferring, 
deferring, preferred, referral 

Contrast two-syllable verbs ending in -fer stressed on first syllable: 
differed, offering, proffered, sufferance. 

In this category, if the suffix constitutes a separate syllable (<-al, 
-ance, -ing>), whether the <r> doubles or not, /r/-linking occurs - 
see section 3.6 - and the <rr, r> is both a grapheme in its own right 
spelling /r/ and part of a larger grapheme <err, er> spelling /3:, 3/ 
respectively. For dual-functioning see section 7.1. 

two-syllable verbs ending in <I>, British spelling: 

(stress on first syllable) cancellation, counsellor, cudgelling, gambolled, 
labelled, leveller, libellous, marvellous, pedalling, quarrelling, 
signalling, traveller 

(stress on second syllable) (un)controllable, compelled, enrolled, 
excelled, fulfilling, propellor, rebellion 
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Compare US spelling: 

(stress on first syllable) cancelation, counselor, cudgeling, gamboled, 

labeled, leveler, libelous, marvelous, pedaling, quarreling, signaling, 

traveler 

(stress on second syllable) (un)controllable, compelled, enrolled, 

excelled, fulfilling, propellor, rebellion. 

Also contrast three-syllable verbs: (un) paralleled. 

Extension: The two-syllable verbs dial, trial and the three-syllable 

verb initial double the <I> in British spelling, despite having a vowel 

letter and not a consonant letter before the <a>: dialling, trialled, 

initialled (cf. US dialing, trialed, initialed). 

Other extensions: 

— diagrammatic, programmatic have <mm>, for (Greek) etymological 
reasons 

— caravanner, caravanning, wainscotting have <nn, tt>, presumably 
to prevent the third <a> or the <o> in the stems appearing to be 
pronounced /e1, au/ 

— questionnaire has <nn> - but millionaire has <n>. No explanation 
of the difference suggests itself 

Two-syllable verbs that end in the single consonant letter <c> after a 

vowel letter mostly double it to <ck> before <-ed, -ing>: frolicking, 

mimicking, panicking, picnicked, shellacked. This also applies to 

the three-syllable verb bivouacked and, by further extension, to the 

adjectives panicky, rheumaticky (finicky appears to be a stem word). On 

11 March 2013 in a column for the Guardian a Kenyan denied that his 

country would be ‘banana-republicked’. But the principle is not extended 

to the one-syllable verb arc, which has the forms arced, arcing, not 

“arcked, ‘arcking. \f the verb spec meaning ‘draw up a specification’ has 

derived forms they might also be speced, specing rather than “specked, 

“specking (since these might derive from speck). And speccy (‘derogatory 

name for a person who wears spectacles’) belongs to a different group 

of exceptions (see under /k/ spelt <cc> in section 3.7.1). 

Exceptions: conference, deference, preference, reference do not 

have <rr> because the stress has shifted to the first syllable (and 

the vowel in <fer> may be elided - see section 6.10); contrast 

conferring, deferring, preferred, referral; and compare referee, with 

stress shifted to the final syllable, and <r> 
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the verbs infer, transfer do not double the <r> before <-able>: 
inferable, transferable, presumably because there is variation in 
which syllable is stressed: /trains'f3irabal/ vs /'trarnsfrabal/ - for the 
elided vowel in the latter pronunciation see section 6.10 
where a two-syllable compound verb has a monosyllabic verb as its 
last element, consonant-doubling occurs even if the stress is on the 
first syllable: inputting, leapfrogged 
some two-syllable verbs with stress on the first syllable double the 
final consonant to avoid the vowel in the last syllable of the stem 
looking as if it should be pronounced long: formatted, hobnobbing, 
kidnapped, worshipper. 
Oddities: The independent noun chancellor, the noun tranquillity and verb 
tranquillise which are based on a two-syllable adjective, the noun teetotaller 
which is based on a three-syllable adjective, and the seven words coralline, 
crystalline, crystallise, panellist, pupillage, rascally, sibylline which are 
based on two-syllable nouns (so none of these words are based on two- 
syllable verbs), nevertheless have <Il> before endings beginning with a 
vowel letter in British spelling (but not in US spelling: chancelor, tranquility, 
coraline, crystaline, crystalize, panelist, pupilage, rascaly, sibyline) for no 
reason that I can find, except perhaps a mistaken analogy with words based 
on two-syllable verbs. Also, the adjective woollen has <Il> in British spelling 
(but the US spelling is woolen), and woolly has <Il> in both systems. And the 
British spelling of the adjective weaselly has <Il> (US spelling allows both 
weaselly and weasely). 


4.3 Other hints for writing a consonant letter 
double 


4.3.1 Where the two parts of a compound word, or 
an affix and a stem, have adjacent identical 
consonant letters, the consonant letter is 
written double 


If the second part of a compound word begins with the same consonant 
letter as the first part ends in, the consonant letter is written double: 
bathhouse, beachhead, fishhook, glowworm, headdress, hitchhiker, 
penname, slowworm, stepparent, still-life, withhold. As already noted, 
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this list contains some of the few examples where letters which are 
rarely written double do occur double. There are various exceptions, 
the most frequent being grandad which in my opinion should be spelt 
granddad. For others, see section 4.4.7. 

If a suffix begins with the same consonant letter as the stem ends in, 
the consonant letter is written double: actually, keenness, soulless. 

If a prefix ends in the same consonant letter as the stem begins 
with, the consonant letter is written double, for example dissatisfy, 
illegible, immature, innate, irreplaceable, misspell, overrun, pellucid, 
subbranch, transsonic, unnatural. The problem here is knowing which 
word-beginnings are prefixes (this is clear enough with <mis-, over-, 
sub-, trans-, un->, although there are few words beginning miss-, 
overr-, Subb-, transs- or unn-) but otherwise often needs etymological 
knowledge. For example, in announce and assimilate, the doubled 
consonant letters arise historically from assimilation of the Latin 
prefix ad- to the first consonant of the stem, and in pellucid from 
assimilation of the Latin prefix per-, but this is not much help - mostly 
you just have to remember, that is develop a feel for the pattern of, 
which words have a doubled consonant letter in this position and 
which do not. 


4.3.2 Monosyllabic content words with /VC/ 
structure have a double consonant letter: the 
Three-Letter Rule 


The entire list of words to which this ‘rule’ applies is add, all, ass, ebb, egg, 
ell, ill, inn, odd, off (contrast the function words as, in, of), and the name 
Ann, and within this list the doubling in all, ass, ell, ill, off is regular anyway. 
In terms of spelling, this rule also applies to err (contrast the filler word 
er) even though its pronunciation is a single long vowel and contains no 
consonant (in RP). In all, however, where consonant-doubling is concerned, 
this rule seems to apply only to the 12 three-letter words just listed. 

Exceptions: ad (advert(isement)), e/, em, en (old-fashioned printers’ 
terms for sizes of spaces), id. 

This is part of a general tendency in English that content words must be 
spelt with at least three letters, even if they contain only one or two phonemes 
and therefore could be spelt with fewer than three letters. Other examples are: 
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(one-phoneme words) awe, aye, eye, ore, owe (contrast the function 
words /, oh, or) 
(two-phoneme words) bee, buy, bye, hie, high, hoe, low, know, sew, 
SOW, wee (contrast the function words be, by, Hi!, ho (exclamation of 
surprise), lo, no, so, we), plus ate pronounced /et/, bow, car (cf. the 
vehicle name Ka), chi (and this Greek letter name does indeed have 
the alternative spelling ki), die, doe (but the spelling do is pre-empted), 
dough, dye, ewe, far, fee, feo, fie, foe (contrast Fo, as in Fee, Fie, ..., 
Fum - but is Fo a word?), guy, joe, key, knee, lea, lee, lie, lye, mow, 
nigh, pea, pee, pie, poh, quay, roe, row, sea, see, sigh, tea, tee, toe 
(but the spelling to is pre-empted), tow, rue, rye, vie, whoa, woe, yew. 
Some of the positional spelling constraints of English help to maintain 
the three-letter rule. For example, if it were generally acceptable to spell 
word-final /d3/ with <j> then edge could be spelt “ej. This spelling would 
observe the rule against doubling <j>; “ejj would be an even odder spelling 
(the <jj> in hajj, now the accepted spelling of the word for the Muslim 
pilgrimage, reflects the doubled pronunciation of the final consonant in 
Arabic). Some of the common digraphs also help to maintain the three- 
letter rule; for example, ash would consist of two letters if English had a 
one-letter grapheme for /{/. 

Three-phoneme content words containing /ks/ are mostly written 
with two letters, using <x>; ax (in US spelling; contrast British axe), ex 
(contrast the river-name Exe), ox and the Greek letter name xi. However, 
the examples just cited appear to be the only 3-phoneme words in the 
language containing the sequence /ks/. There are about 15 three-phoneme 
words containing /ju:/, which can be spelt <u>, and these words could 
therefore also in theory be written with two letters. However, only the Greek 
letter names mu and nu are written this way, and all the rest are written with 
at least three letters: cue, dew, due, few, hew, hue, lieu, mew (homophone 
of mu), gnu, knew, new (these last three being homophones of nu), pew, 
queue, view. And neither ewe nor you would ever be written “u or “yu (cep 
wen txtng, fcors). 

Other function words spelt with fewer than three letters are a, ah, am, 
an, as, at, er, he, |, if, in, is, it, me, my, of, on, so, to, up, us, we, ye. Do and 
go, despite often being content words, make do with two letters because 
of their other use as auxiliary verbs; in contrast, the function word are has 
three letters even though it could be spelt ar like the filler word (but this 
would make the contracted forms “they’r, ‘we’r, ‘you’r look very odd). 
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Other content words which are spelt with two letters (and are therefore 
exceptions to the three-letter rule) are ma, pa, pi (the Greek letter name 
and numerical constant; contrast pie), ta (‘thanks’; contrast tay), and the 
musical terms do, re, mi, fa, so, Ia, ti. 


4.3.3 Consonant phonemes /b df gk pt 2z/ are 
almost always spelt with double letters before 
final /al/ spelt <-le> where the immediately 
preceding vowel phoneme is short, stressed 
and spelt with a single letter 


Examples: 
babble, dabble, gabble, rabble, scrabble, pebble, dibble, dribble, nibble, 
scribble, bobble, cobble, gobble, hobble, nobble, wobble, bubble, rubble, 
stubble; 
addle, skedaddle, paddle, saddle, staddle, straddle, swaddle, twaddle, 
waddle, meddle, peddle, Biddle, diddle, fiddle, griddle, middle, piddle, riddle, 
twiddle, widdle, coddle, doddle, noddle, toddle, cuddle, (be)fuddle, huddle, 
muddle, puddle; 
baffle, raffle, snaffle, waffle, piffle, riffle, skiffle, sniffle, whiffle, duffle, 
kerfuffle, muffle, ruffle, scuffle, shuffle, snuffle, truffle; 
(be)draggle, gaggle, haggle, raggle-taggle, snaggle, straggle, waggle, giggle, 
jiggle, niggle, wiggle, wriggle, boggle, boondoggle, goggle, hornswoggle, 
joggle, toggle, woggle, juggle, muggle, smuggle, snuggle, struggle; 
cackle, crackle, hackle, (ram)shackle, tackle; freckle, heckle, speckle; fickle, 
mickle, pickle, prickle, sickle, stickle, tickle, trickle; cockle; buckle, chuckle, 
knuckle, muckle, suckle, truckle; 
apple, dapple, grapple, cripple, nipple, ripple, stipple, tipple, topple, supple; 
(em) battle, cattle, prattle, rattle, tattle, wattle, fettle, kettle, mettle, nettle, 
Settle, brittle, little, skittle, spittle, tittle, whittle, bottle, dottle, mottle, pottle, 
throttle, cuttlefish, scuttle, shuttle, Suttle; 
(be) dazzle, frazzle, razzle-dazzle, embezzle, drizzle, fizzle, frizzle, grizzle, 
mizzle, sizzle, swizzle, nozzle, s(c)hemozzle, schnozzle, guzzle, muzzle, 
nuzzle, puzzle. 

Also squabble, quibble, squiggle if the <u> in these words is counted (as 
it should be) as a consonant letter. 

Most of the words in this list belong to the less formal/more Anglo- 
Saxon part of the vocabulary. This rule is one of the only two situations in 
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which <zz> is the regular spelling of /z/, since /z/ is never spelt <s, ss, z> 
in this position. 

Exceptions: chattel, subtle, treble, triple (if these words followed this 
rule they would be spelt ‘chattle, “suttle, “trebble, ‘tripple - and, as shown 
above, there is a surname spelt Suttle). 

Extensions (1): Where the consonant between the vowel and <-le> is 
/s/ itis mainly spelt <st> (see section 3.7.6): nestle, pestle, trestle, wrestle; 
bristle, epistle, gristle, mistle thrush (also spelt missel thrush), thistle, 
Thistlethwaite, Twistleton, whistle; apostle, jostle, Postlethwaite, throstle; 
bustle, hustle, rustle. This extension also applies to mistletoe even though 
the <-le> is not word-final. Sub-exceptions: hassle (but | once received 
an email with this word spelt “hastle, showing the power of the <st> sub- 
rule), tussle which conform to the main rule above, plus muscle, which 
conforms to neither the main rule nor this sub-rule about medial /s/ (nor 
does corpuscle, but since it is stressed on the first syllable it does not fall 
under the main rule), and missel thrush in that spelling. 

Extensions (2): There are also a few words where the other conditions 
are met (consonant preceding final /al/ not in the set /Il, m, n, r/ or in 
the set which never double, vowel preceding that stressed, short and spelt 
with one letter) but the final /al/ is not spelt <-le> which nevertheless 
have the consonant spelt double: chattel, cudgel, duffel, estoppel, fossil, 
glottal, jackal, missal, missel thrush in that spelling, mussel, nickel, offal, 
rebuttal, satchel, tassel, vassal, vessel, wittol and a few words in <-ittal> 
which are derived forms obeying the main consonant-doubling rule: 
(ad quittal, committal, remittal. This list contains the only words, apparently, 
in the entire language with final /al/ preceded by /dg, t{/ spelt double as 
<dg, tch>: cudgel, satchel. 

There are no words following this pattern in which the consonant phoneme 
before the /al/ is /m, n, r/, that is, none spelt <*-mmle, “-nnle, “-rrle> (for 
a possible reason see the end of section 4.4.3) - contrast mammal, pommel, 
pummel, trammel. channel, flannel, fennel, kennel, funnel, runnel, tunnel; 
barrel, sorrel, also quarrel, squirrel if the <u> in these words too is counted 
as a consonant letter. Also, it would be odd if the consonant phoneme before 
the /al/ were /I/ - the only word with a short, stressed vowel followed by 
/\/ followed by /al/ appears to be the obsolete word fallal (‘trinket’). And 
by definition this rule cannot apply to <h, j, q, v, w, x, y> even though, for 
example, axle, hovel could in theory be spelt ‘axxle, “hovvle. 

For the converse of this rule, see sections 4.4.2-3 below. 
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The sets of consonant phonemes to which this rule does or does not apply 
cut across those which are mainly written double or single at the end of one- 
syllable words after a short vowel spelt with one letter - see Table 4.2. 


TABLE 4.2: NON-EQUIVALENCE OF SETS OF CONSONANT PHONEMES 
SPELT DOUBLE IN TWO SITUATIONS 


Consonant phonemes 
which are mainly written 
double at the end of one- 
syllable words after a short 
vowel spelt with one letter 


Consonant phonemes which 
are mainly written single 

at the end of one-syllable 
words after a short vowel 
spelt with one letter 


Consonant phonemes which are 
mainly written double between 


the rule above does not apply 


a stressed short vowel spelt /k f s* z/ /bdgpt/ 
with one letter and word-final 
/al/ spelt <-le> 
Consonant phonemes to which 
/tf d3 lv/ /m n/ 


“Is/ is mainly spelt <ss> word-finally and <st> medially in these circumstances. 


Consonant phonemes to which both rules are irrelevant: /h r wj/ because 
they do not occur word-finally, and /n J 3 8 6/ because they have no one- 
letter word-final spelling. 


4.3.4 More generally, consonant letters are mostly 
written double in the middle of two-syllable 
words where the immediately preceding 
vowel phoneme is short and written with a 
single letter 


Unlike the previous rule, this one applies only to two-syllable words, but 
regardless of which syllable is stressed. Examples and exceptions (none of 
which have final /al/ because of the preceding section) are listed in Table 
4.3, grouped in order of the phonemes in Table 2.1. 

As Table 4.3 shows, /v/ is the only phoneme in this position for which 
one-letter spellings are in the majority - see also section 4.4.3. 
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TABLE 4.3: EXAMPLES OF AND EXCEPTIONS TO THE RULE THAT TWO-SYLLABLE 
WORDS HAVE MEDIAL CONSONANT LETTERS WRITTEN DOUBLE AFTER A SHORT 
VOWEL PHONEME WRITTEN WITH A SINGLE LETTER 


Phoneme Examples Exceptions 

/b/ abbey, abbot, bobbin, cabbage, gibbous, hobbit, | abet, cabin, robin, suburb 
hobby, hubbub, rabbit, ribbon, robber, rubber, 
rubbish, Sabbath, stubborn 

/d/ adduce, buddy, haddock, judder, midden, adit, edit 
sodden, sudden 

/g/ beggar, dagger, haggis, jagged, maggot, vigour 
nugget, ragged, rugged, rugger, sluggish, 
trigger 

/m/ ammo, command, commend, commie, commit, amuse, camel, comet, 
common, commute, gammon, grammar, damage, famine, gamut, 
hammock, hummock, immense, immerse, lemon, premise, promise 
lemma, mummy, slummock, stammer, summer, 
summit, summor(s) 

/n/ annex(e), announce, annoy, banner, bonnet, banish, canard, canon, 
cannon, channel, connect, dinner, fennel, enough, menace, money, 
flannel, funnel, ginnel, innate, kennel, penny, onyx, penance, planet, 
runnel, tenner, tunnel punish, tenor 

/p/ appal, appeal, appear, apply, approve, copper, | epic 
happy, hippy, oppose, puppet, supper, tappet 

/t/ attack, attempt, attend, attract, better, butter, | atom 
button, buttress, ghetto, glitter, jitter(s), latter, 
letter, mattress, rattan, rotten, scatter, tattoo, 
tittup, written 

/r/ arrange, arrest, barrow, berry, borrow, cherish, cherub, larynx, 
burrow, carriage, carrot, carry, cherry, correct, | tarot 
derrick, garrotte, herring, horrid, hurry, lorry, 
marriage, mirror, morrow, parrot, porridge, 
quarry, serrate, sorrow, sorry, squirrel, stirrup, 
terrine, terror, warrant, wherry, worrit, worry 

/z/ none spelt with <zz>, but cf. dessert, dissolve, | none spelt with <z>, but 
hussar, possess, Scissors cf. bosom, busy, closet, 

risen 

/k/ account, hiccup, occur, peccary, soccer, speccy | decade, vacuum 
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TABLE 4.3: EXAMPLES AND EXCEPTIONS OF THE RULE THAT TWO-SYLLABLE WORDS 
HAVE MEDIAL CONSONANT LETTERS WRITTEN DOUBLE AFTER A SHORT VOWEL 
PHONEME WRITTEN WITH A SINGLE LETTER, CONT. 


Phoneme Examples Exceptions 
/f/ affair, affect, affirm, affix, afflict, before, default, defeat, defect, defence, 
afford, affray, affright, affront, defend, defer, defile, defunct - but all 
buffer, boffin, buffoon, caffeine, of these are regular because of the 
chaffinch, chiffchaff, chiffon, coffee, | prefixes. Also, with long vowel in first 
coffer, coffin, differ, diffuse, efface, | syllable, gofer, tofu, tufa, wafer 
effect, effete, effort, gaffer, griffin, 
guffaw, jiffy, muffin, offal, offend, 
offer, office, proffer, puffin, riffraff, 
saffron, scaffold, suffer, suffice, 
suffix, suffrage, suffuse, tiffin, toffee. 
Also, with long vowel in first syllable, 
chauffeu-r/se, coiffure, souffle 
/\/ allay, allege, allot, allow, alloy, balance, chalet, choler, colour, column, 
allude, allure, ally, ballet, balloon, felon, lily, malice, olive, palace, palate, 
billet, bullet, bully, callous, callow, police, salad, salon, scholar, solemn, 
challenge, collage, collapse, collar, talent 
collate, colleague, collect, college, 
collide, collie, collude, ellipse, fellow, 
follow, pallet, pallor, pillar, pollute, 
shallot, silly, swallow, trellis, wallet; 
also billion, brilliant, colliery, million 
if pronounced with two syllables 
/s/ assent, assert, assess, assume, 
blossom, cosset, cussed (/'kas1d/ 
‘stubborn’, gossip, massive, missive, 
posset 
Iv/ bevvy, bower, chivvy (also spelt bevel, carvel, cavil, chervil, chivy (also 
chivy), civvy, divvy, flivver, luvvy/ie, | spelt chivvy), civil, clever, coven, covet, 
navvy, revving, savvy, skivvy devil, ever, evil, frivol, gavel, govern, 
gravel, grovel, hovel, larval, level, 
marvel, naval, navel, never, novel, 
oval, prevent, serval, travel, ravel, 
revel, shovel, shrivel, snivel, swivel 


Many more double consonants in two-syllable words result from affixation 


and the main consonant-doubling rule, e.g. misspelt, misspent, sub-branch, 


fitter, fully, furry, goddess, mannish, matting, sadder, saddest, starry, fitted, 
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hopping, plodder, riddance, running, starring; fatten, flatten, gladden, 
madden, redden, sadden, gotten, bitten, hidden, ridden, smitten, written. 

Extension: This pattern also applies not only to /k/ spelt <ck>, e.g. 
beckon, chicken, cricket, gecko, jacket, pocket, reckon, rocket, sprocket, 
but also to /d3, t{/ spelt <dg, tch>, e.g. badger, bludgeon, budget, budgie, 
codger, fidget, gadget, ledger, midget, todger, widget, butcher, crotchet, 
hatchet, ketchup, kitchen, ratchet, scutcheon, wretched. 

However, most two-syllable words ending in <-ic, -id, -it, -ule> do not 
obey this rule - see section 4.4.6. And by definition this rule too cannot 
apply to <h, j, q, W, X, y>. 

For the converse of this rule, see section 4.4.4 below. 


4.3.5 At the end of one-syllable words where the 
preceding vowel phoneme is short and spelt with 
a single letter the following consonant phonemes 
are mostly written double: /k t{ f &lszv/ 


This generalisation brings together the rules for these consonants stated 
individually in sections 3.7.1-8. 

Examples: back, hick, quick, rock, sack, sick, tick, hutch, itch, match, 
duff, off, badge, bodge, fill, full, shall, buss, fuss, puss, jazz, dove, shove 

Exceptions: hic, roc, sac, sic, tic, much, rich, such, chef, clef, deaf, if, veg, 
Czech, flak, suk, trek, yak, col, gal, gel(both pronunciations and meanings), mil, 
nil, pal, bus, gas, plus, pus, this, thus, us, yes, as, cos (in both pronunciations 
/koz, kos/), has, his, is, was, gov, guy, lav, of, rev, shiv, sov, spiv 


4.4 Hints for not writing consonant letters 
double 


4.4.1 At the end of one-syllable words where the 
preceding vowel phoneme is short and spelt with 
a single letter the following consonant phonemes 
are mostly written single: /b d g mn pt/ 


This generalisation brings together the rules for these consonants stated 
individually in sections 3.5.1-7. 
Examples: rob, bad, dog, jam, run, lap, put 


124 Dictionary of the British English Spelling System 


Exceptions: ebb, add, odd, rudd, Sudd, egg, Ann, inn, Lapp, bott, butt, 
matt, mitt, mutt, putt, watt. 

There appear to be no exceptions ending <-mm>. Six of the exception 
words obey the /VC/ part of the ‘Three-Letter Rule’ - see section 4.3.2. 


4.4.2 When do you not write consonant phonemes 
/bdfgkptz/ with double letters before 
final /al/ spelt <-le>? 


In other words, when does the rule in 4.3.3 above not apply? When any of 

the conditions mentioned there is missing, namely: 
where the immediately preceding vowel is unstressed, e.g. laughable, 
visible and all the other words ending in /abal/ (see section 6.6), plus 
article, carbuncle, multiple, principle, tubercle, ventricle 
where the preceding vowel is short and stressed but spelt with more 
than one letter, namely couple, double, treadle, trouble. These are the 
only exceptions in this category; if they followed the main ‘stressed 
short vowel + single consonant phoneme + /al/’ pattern they would 
be spelt “cupple, “dubble, ‘treddle, ‘trubble 
where the preceding vowel is stressed but long, e.g. able, bamboozle, 
bauble, beadle, beagle, beetle, bible, boodle, bridle, bugle, burble, 
chortle, circle, cradle, curdle, cycle, dawdle, disciple, doodle, eagle, 
fable, feeble, foible, garble, gargle, gurgle, hurtle, idle, inveigle, kirtle, 
ladle, maple, marble, myrtle, needle, noodle, noble, ogle, people, 
poodle, purple, rifle, rouble, scruple, sidle, sparkle, staple, startle, 
Steeple, stifle, table, title, tousle, treacle, trifle, turtle, warble, wheedle 
where there are two consonant phonemes between the stressed, short 
vowel and the /al/, e.g. amble, ample, assemble, bramble, brindle, 
bumble, bundle, candle, cantle, crumble, crumple, dandle, dimple, 
dissemble, dwindle, ensemble, example, fondle, fumble, gamble, gentle, 
grumble, handle, humble, jumble, kindle, mantle, mumble, nimble, 
pimple, ramble, resemble, ramble, rumble, rumple, sample, scramble, 
shambles), simple, spindle, stumble, swindle, temple, thimble, trample, 
tremble, trundle, tumble, uncle, wimple; also the group in /ngal/ spelt 
<-ngle>: angle, bangle, bungle, dangle, dingle, jangle, jingle, jungle, 
mangle, mingle, shingle, single, spangle, strangle, tangle, tingle, 
wangle, wrangle. 

But note (for its relevance to the next section) that all these categories of 

exception still spell final /al/ with <-le>. 
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The rule in section 4.3.3 also mostly does not apply (but see the words 
listed towards the end of section 4.3.3) where the /al/ ending is not written 
<-le>, e.g. pedal, rebel, shekel. 


4.4.3 Digression: When do you not spell final /al/ 
as <-le>? 


Strictly speaking this section does not belong in a chapter on doubled and 
single consonant spellings (logically it belongs under /a/ in section 5.4.7), 
but its relevance will become apparent at the end of this section; and it 
arises pretty directly out of the last paragraph in the previous section, where 
spellings of word-final /al/ other than <-le> are mentioned. 
Three categories where final /al/ is not spelt <-le> have already been 
mentioned: 
The words listed at the end of section 4.3.3 where the preceding 
consonant is spelt double even though the final /al/ is not spelt <-le> 
Words with the ‘short vowel + single consonant phoneme + /al/' 
pattern in which the consonant phoneme is /m, n, r/. These are 
almost all spelt with <-el> even though the consonant is spelt double: 
pommel, pummel, trammel, channel, flannel, fennel, kennel, funnel, 
runnel, tunnel, barrel, quarrel, sorrel, squirrel. Exception: mammal 
Words where the medial consonant is /v/ (this list expands on those 
above): anvil, approval and several other nouns ending in /u:val/ 
spelt <-oval>, (anrival and many other nouns and adjectives ending 
in /atval/ spelt <-ival>, bevel, carnival, carvel, cavil, chervil, civil, 
coeval and various other words ending in <-eval>, devil, dishevel, 
drivel, estival, evil, festival, frivol, gavel, gingival, gravel, grovel, 
hovel, interval, larval, level, marvel, naval, navel, novel, oval, 
(un)ravel, retrieval, revel, serval, shovel, shrivel, snivel, swivel, travel, 
upheaval, valval, weevil. 
This list of words with medial /v/ illustrates very clearly most of the range 
of other spellings for final /al/: <-al, -el, -il, -ol>. (The only ones not 
illustrated are <-ul, -yl>, which are very rare and do not occur with medial 
/v/.) But it also raises the question: can any rules be given for when to use 
each of these six possible spellings of final /al/ other than <-le>? 
(Here | ignore the words where final /al/ is spelt <I>, since there are only 
three words in this set: axolotl, dirndl, shtetl). 
Carney (1994: 346) points out that the following three categories mainly 
have <-al>: 
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nouns formed from verbs: appraisal, approval, arousal, avowal, 
betrothal, dispersal, disposal, espousal, perusal, proposal, recital, 
refusal, renewal, reversal, withdrawal 
adjectives formed from nouns: basal, bridal, brutal, causal, central, 
colloidal, digital, fatal, formal, fugal, homicidal, modal, orbital, oriental, 
primal, spinal, spiral, thermal, tidal, tonal, triumphal, universal 
adjectives based on bound forms: conjugal, dental, final, frugal, 
fungal, glottal, legal, marital, mental, municipal, mural, natal, 
nominal, ordinal, papal, principal, regal, renal, skeletal, vital. 
He also points out that words ending in /akeal, 1kal/ may be spelt <-acle, 
-icle, -ical> - for all of these see section 4.4.6. 
Beyond this the contexts become so specific and any ‘rules’ so 
complicated that it seems simpler to give lists: 
words ending in <-al>: admiral, animal, arsenal, cannibal, caracal, 
coral, crystal, cymbal, dental, dismal, floral, gimbal, hospital, hymnal, 
(im)partial, initial, jackal, journal, legal, lethal, local, madrigal, 
mammal, marshal, martial, medal (cf. meddle), memorial, metal 
(cf. mettle), missal, narwhal, nuptial, offal, opal, parental, pedal (cf. 
peddle), petal, plural, rascal, sacral, sandal, scandal, sepal, several, 
signal, sisal, spatial, substantial, total, vandal, vassal, ventral, vocal 
words ending in <-el>: angel, apparel, babel, betel, bezel, brothel, 
bushel, calomel, camel, cancel, caramel, carpel, chancel, chapel, 
charnel, chisel, cockerel, colonel, corbel, counsel, damsel, diesel, 
doggerel, easel, enamel, evangel, gospel, grapnel, hazel, hostel, kernel, 
label, laurel, libel, lintel, mackerel, mantel, minstrel, model, mongrel, 
morsel, nickel, panel, parcel, pastel, petrel, rebel (noun and verb), 
scalpel, scoundrel, sentinel, shekel, shrapnel, snorkel, sorrel, spandrel, 
timbrel, tinsel, wastrel, weasel, yodel, yokel 
words ending in <-il>: April, basil, council, fossil, gerbil, lentil, nostril, 
pencil, pupil, stencil, tonsil 
words ending in <-ol>: carol, gambol, idol, Mongol, petrol, symbol 
words ending in <-ul>: consul, mogul 
words ending in <-yl>: beryl, (ptero)dactyl, sibyl. 
Some of the words listed in this section ending in <-il, -yl> may be pronounced 
with /1l/ rather than /al/, but very few have this pronunciation consistently. 
Reflecting on my own accent | think | have /1I/ only in anvil, gerbil, nostril 
and (ptero)dactyl; also in idyll and the few compound words ending in -phyll. 
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Why is there such a contrast in the spellings of final /al/ between those 
with <-le> and those with <a, e, i, 0, u, y> followed by <I>? 1 think the 
prime reason for this variation is whether the stem word, when suffixed with 
an ending which begins with a vowel phoneme and adds a syllable, retains 
a schwa vowel before the /I/ or not: where /a/ is not retained, the spelling 
is <-le>, otherwise one of the other possibilities. Consider Table 4.4, which 
is certainly not definitive and where <l>-doubling (in British spelling) is 
ignored, but to which | have yet to find any exceptions. 


TABLE 4.4: SOME CASES WHERE STEM WORDS ENDING IN /al/ DO OR DO NOT 
RETAIN /a/ BEFORE A SUFFIX BEGINNING WITH A VOWEL PHONEME 


with /a/ when suffixed - none spelt without /3/ when suffixed - all spelt 
with <-le> before suffixation with <-le> before suffixation 


cannibalism, hospitalise, journalese, angling, assemblage, babbling, 


mammalian, medallion, metallic, baffling, beagling, bottling, 
pedalling, rascally, scandalous, bristling, burglar, chaplain, 
signalling, vandalism; chortling, coddling, crackling, 
angelic, cancelling, caramelise, cuddling, doubly, drizzling, 
channelling, chiselling, cudgelling, dwindling, embezzler, fiddling, 
evangelise, flannelling, pummelling, gambling, heckler, jangling, 
quarrelling, rebellion, squirrelling, jostling, meddling, muddler, 
tunnelling, yodelling; muffler, multiply (verb or adverb), 
councillor, fossilise, pupillage; nestling, niggling, peddling, 
carolling, gambolling, idolatry, rattling, rippling, rustling, saddler, 
symbolism; smuggler, startling, straggler, 
consulate; tattler, trickling, trifling, visibly, 
beryllium, sibylline whistling, wrestling 


A tiny piece of evidence in favour of my theory might be this. Consider the 
words gambol, gamble; pedal, peddle. As stem words these form two pairs 
of homophones pronounced /'gzmbal, 'pedal/, but when suffixed with /1n/ 
they become (in my accent) two minimal pairs: 


gambolling /‘geambalin/ v. /‘gemblin/ gambling 
pedalling /'pedalin/ v. /‘pedl1n/ peddling 
and the schwa is elided (see section 6.10) only in the words which have final 


/al/ spelt <-le> - or, to put this more phonologically, the <-le> spelling 
occurs only where the schwa is elided. 


128 Dictionary of the British English Spelling System 


4.4.4 When do you not write doublable consonant 
letters double in the middle of two-syllable 
words (other than those ending in /al/)? 


In other words, when does the rule in 4.3.4 above not apply? When either of 

the conditions mentioned there is missing, namely: 
where the preceding vowel is long, e.g. auburn, phoneme, pony - 
this category is very large; bouffant and other polysyllables listed in 
section 4.1.3 are sub-exceptions, having a preceding long vowel, yet 
having their medial consonants spelt double 
where the preceding vowel is short but spelt with more than one letter, 
namely breeches, courage, cousin, flourish, heifer, jealous, meadow, 
nourish, peasant, pheasant, pleasant, ready, steady, weapon, woofer 
pronounced /'wufa/, zeal-ot/ous - this category is very small, probably 
containing only these 16 words; Aussie is a sub-exception, having its 
preceding short vowel spelt with two letters, yet having its medial 
consonant spelt double. Possible additions might seem to be heaven, 
heavy, leaven but medial /v/ is hardly ever spelt <w>. 


4.4.5 The third syllable from the end of a word 
rarely ends in a doubled consonant letter 


Examples and exceptions not arising from affixation (none of which have 
final /al/ because of section 4.4.3) are listed in Table 4.5, grouped in order 
of the phonemes in Table 2.1. 


TABLE 4.5: EXAMPLES OF THE RULE THAT THE THIRD SYLLABLE FROM THE END 
OF A WORD RARELY ENDS IN A DOUBLED CONSONANT LETTER, WITH EXCEPTIONS 
NOT ARISING FROM AFFIXATION 


Many other words could be listed, including almost all of those ending in 
<-ical> (see next section) and all those ending in <-ology>. 


Phoneme | Examples Exceptions not arising from 
affixation 
/b/ abusive, cabaret, (de-)liberate, ebony, | shibboleth 


liberal, liberty, probable, tribunal 


/d/ badinage, judicial, prodigal, tradition 


/g/ bigamy, exogamy, trigamy and many | aggregate, aggressive, doggerel, 
others ziggurat 
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/m/ amateur, criminal, family, nemesis, accommodate, ((un) in) flammable, 
ominous, similar and many others dia/pro-grammatic, immodest 
/n/ analyse, benefit, denizen, draconian, | binnacle, cannibal, perennial, 
economic, general, genital, manacle, | pinnacle, tinnitus, zinnia 
manager, manifest, minimal, 
minimum, minister, obscenity, 
penalise, plenary, sanity 
/p/ capital, popular, supersede and many | apparel, apparent, apprehend, 
others frippery, opposite, opponent 
/t/ cataract, gratitude, platitude, attention, attraction, attribute, 
Strategy and many others battery, petticoat, smattering 
/r/ arena, caravan, character, chariot, arroyo, carrion, corridor, erratic, 
clarify, clarion, coracle, heresy, horrendous, horrible, horrific, 
heroin(e), irony, miracle, oracle, horrify, hurricane, interrogate, 
origin, serious, spiracle scurrilous, serrated, terrible, terrific 
/z/ hesitate, misery, visible and many brassiere, razzmatazz 
others 
/k/ executive, faculty and many others accomplice, accomplish, cockerel, 
impeccable, mackerel, occupy, 
pickerel 
/f/ defeasance, defecate, defensive, affable, afferent, affiance, affluent, 
deference, defiance, deficient, deficit, | buffalo, daffodil, difficult, diffident, 
definite, mafia, safari and a few effable, effendi, efferent, effervesce, 
others efficient, effigy, effloresce, effluent, 
effulgent, effusive, graffiti, ineffable, 
offensive, official, raffia, ruffian, 
suffocate, suffragan, taffeta, tiffany 
/\/ celery, element, elephant, holiday, allegiance, allergy, allocate, allusion, 


military (pronounced /'mulrtri:/, with 
three syllables - see section 6.10), 
pelican, quality, relevant, solitude, 
telephone, vilify 


ballistic, ballyhoo, bellicose, 
beryllium, bullion, bulletin, celluloid, 
collier, collision, colloquy, ebullient, 
fallacy, fallible, gallery, galleon, 
gallivant, gullible, illegal, illicit, 
illusion, illustrate, intelligent, lollilop, 
metallurgic, metallurgy, miscellany, 
mullion, palliate, pellucid, pillion, 
pillory, postillion, raillery, scallywag, 
scullery, scullion, stallion, syllable, 
syllabic, syllabub, syllabus, sylloge; 
also billion, brilliant, million if 
pronounced with three syllables 
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TABLE 4.5: EXAMPLES OF THE RULE THAT THE THIRD SYLLABLE FROM THE END OF 
A WORD RARELY ENDS IN A DOUBLED CONSONANT LETTER, WITH EXCEPTIONS NOT 
ARISING FROM AFFIXATION, CONT. 


Phoneme | Examples Exceptions not arising from affixation 
/s/ complicity, ferocity and dozens | ambassador, assemble, dissemble, 

of other words listed in Table dissension, dissipate, dissolute, 

3.4 dissonance, essential, tessellate 
Iv/ cavity, evidence, government, (none) 


levity, poverty, privacy, trivial 


These lists appear to show that for /f, I/ the balance is the other way - 
doubled spellings outnumber one-letter spellings in this position. 

Some other exceptions do arise from suffixation, e.g. rabbinic, robbery, 
shrubbery, addiction, addictive; communist, settlement, diffusion, officer, 
alliance, hellenic, medallion. 


4.4.6 Doubled consonant letters are very rare 
immediately before the endings <-ic(al), -id, 
-it, -ule> 


Examples: acidic, acoustic, acrobatic, agaric, aquatic, arabic, catholic, 
choleric, clinic(al), comic(al), diagrammatic, ecliptic, economical), elliptic(al, 
erratic, etymological, fanatic, genetic, graphemic, heroic, historic(al), horrific, 
lunatic, lyric(al, medical, metallurgic(al, mimic, phonemic, politic(al/s), 
programmatic, rabbinic(al), radical, rhetoric(al), sonic, sporadic, strategic, 
syllabic, terrific, topic(al, typical, volcanic, acid, arid, avid, fetid, florid, 
frigid, intrepid, placid, rabid, rapid, rigid, solid, stolid, tepid, timid, valid, 
vapid, vivid; credit, davit, deposit, emit, habit, (i)licit, limit, omit, profit, 
spirit, visit, vomit; globule, module, schedule. 

Exceptions: attic, britannic, classical, cyrillic, ferric, gallic, idyllic, 
jurassic, metallic, phallic, philippic, prussic, quizzical, tannic, traffic, triassic, 
tyrannical, flaccid, horrid, pallid, torrid, triffid; commit, hobbit, rabbit, soffit, 
summit, whodunnit, worrit; cellule, ferrule, floccule, gemmule, pinnule. 

Most of the exceptions are instead obeying the rule that preceding short 
vowels in two-syllable words are followed by doubled consonant letters 
(section 4.3.4). 

Though most words ending /1kal/ where the ending is unstressed are 
spelt <-ical>, there are a few exceptions: article, canticle, cubicle, chronicle, 
clavicle, conventicle, curricle, cuticle, fascicle, follicle, icicle, particle, testicle, 
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vehicle, ventricle. And where the /1/ in /'tkal/ is stressed this ending is spelt 
<-ickle> - see section 4.3.3. (See section 6.10 for words ending in /rkliz/ 
spelt <-ically>.) For the few polysyllabic words in which the ending /1k/ is 
not spelt <-ic> see Table 3.3. 

There is also a group of words ending in /akal/ spelt <-acle> which 
need to be mentioned here: barnacle, binnacle, coracle, manacle, miracle, 
obstacle, oracle, pinnacle, receptacle, spectacle, spiracle, tentacle, 
tabernacle. None seem to have pronunciations in /1kal/, and only binnacle, 
pinnacle are exceptions to the rarity of doubled letters before such endings. 


4.4.7 When do you reduce <Il> to <1>? 


There are some stem words which have <Il> when they stand alone, but 
sometimes <I> when they do not. As far as | can tell, this affects only the 
few adjectives ending in <II>, adjectives ending in <-ble> when suffixed to 
become adverbs, and the words all, chill, fill, full, null, pall, roll, skill, stall, 
Still, thrall, till (preposition) and will: 
adjectives ending in <Il> lose an <I> before the adverbial ending 
<-ly>: drolly, dully, fully, shrilly (see also section 4.6.1) 
similarly, adjectives ending in <-ble>, when combining with <-ly> 
to form adverbs, first lose the <e>, then lose an <I>, e.g. probably, 
visibly and not “probablely, ‘visiblely or even “probablly, *visiblly 
allsometimes loses an <I> when it is a prefix: albeit, almighty, almost, 
already, although, altogether, always (but not usually “alright) 
till loses an <I> in until 
chillloses an <I> in chilblain (but not in windchill 
Skill, willlose an <l> before -ful: skilful, wilful 
full loses an <I> in the compound verb fulfil and the derived noun 
fulfilment, and in all the adjectives and nouns ending in -ful, e.g. 
beautiful, handful (but not in craw-full. The noun fulness has two 
spellings 
fill also loses an <I> in the compound verb fulfil and the derived 
noun fulfilment, but because fulfil is a two-syllable verb with stress 
on the second syllable AND ending in <I>, it has <Il> before suffixes 
beginning with a vowel letter (and then is the same in British and US 
spelling): fulfilling, fulfilled (but contrast infill, refill 
five of the other six stems behave like fill when forming prefixed or 
suffixed verbs and nouns derived from them: annul, annulment, appal, 
enrol, enrolment, instil, instiiment (a rare but real word), enthral, 


132 Dictionary of the British English Spelling System 


enthralment, thraldom (also spelt thralldom, for which see also 
section 10.42) 
Stall loses an <I> only in instalment and not in forestall, install, 
installation. The spelling “instal, for example, would suggest the 
pronunciation /'tnstal/, not /1n'sto:1/. 
Most of the words listed above lose an <I> in both British and US spelling, 
but in a few cases the single <I>’s listed above (usually) remain double 
in US spelling: skillful, willful, fulfill, fulfillment, fullness, appall, enroll, 
enrollment, instill, instillment, enthrall, enthrallment, installment. 

Extensions (1): Given ... sixth, seventh, ... sixteen, seventeen... and sixty, 
seventy ..., one might have expected “eightth, “eightteen, “eightty, but these 
are always reduced to eighth, eighteen, eighty. 

Extensions (2): In dispirit-ed/ing, pastime, transpire, <ss> becomes 
<s>, and several compounds of mass (‘religious service’) end in -mas: 
Candlemas, Christmas, Lammas, Martinmas, Michaelmas. But less always 
retains its full spelling as a suffix, e.g. hopeless, useless. 

Extensions (3): The word meaning ‘male grandparent’ should, logically, 
be spelt Granddad but is almost always simplified (incorrectly, in my 
opinion) to Grandad (cf. sections 3.5.5, 9.23 on “Granma), and on 22/4/14 
| came across ‘grandaughter on a birthday card website. 


4.5 Learn the rest 


There are other more detailed tendencies and quasi-regularities. But they 
are complicated to state, and some require knowledge of etymology or 
very close attention to pronunciation; and all have exceptions. So other 
words (and there are many of them) you just have to learn. And, sadly, 
accommodation, with <cc> and <mm> (but <d>), and necessary, with 
<ss> (but <c>), are two of them. 


4.6 Consolation prizes 


4.6.1 Consonant letters are never written triple 


Well, almost never; the only words in English containing three consecutive 
identical consonant letters are said to be Invernessshire and Rossshire 
(though there is also still-life, which has to have the hyphen to make it 
conform to the rule). What this rule is really saying is that (for instance) when 
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adjectives ending in <Il> have -/y added, the resulting adverb is written with 
<Il>, not “<IIl>: e.g. drolly, fully, not “drollly, *fullly. This ensures that the 
separate word fully looks the same as the ending of adverbs derived from 
adjectives in -ful, e.g. beautifully. (In Estonian, which is said to have triple 
consonant phonemes, they are still written with double letters). 


4.6.2 Final <CC> + <e> 


And there is another pattern which is pretty reliable. Where a word ends 
in a short vowel phoneme plus a consonant phoneme and then is written 
with a final <e>, the consonant letter is usually written double. Final 
/short vowel/+<CC+e> is admittedly rare. It occurs mainly in words more 
recently borrowed from French (i.e. after the Great Vowel Shift), with a few 
imports from elsewhere, for example gaffe, bagatelle, fontanelle, gazelle, 
grille, vaudeville, programme, comedienne, grippe, steppe, finesse, impasse, 
largesse, etiquette, gazette, lorgnette, mignonette, omelette, palette, 
toilette, vignette, plus all the recent coinages ending <-ette> or <-ville>, 
e.g. ladette, launderette, dullsville. Braille, giraffe, pouffe and mousse 
can be considered as extensions to the pattern - the preceding vowel 
phonemes are long (in RP) - and carafe is a clear exception, with <f>. It is 
also noticeable that most of the polysyllables in this category have final- 
syllable stress - some exceptions are dullsville, etiquette, omelette, palette, 
programme, vaudeville, all with initial-syllable stress. 

The written pattern <-CC+e> occurs also in barre and bizarre, but here, 
since these words do not end in a /r/ phoneme (in RP), <arre> is a four- 
letter grapheme spelling the long vowel /a:/. Similarly, in parterre, <erre> 
is a four-letter grapheme spelling the diphthong /ea/. But all three words 
conform to the spelling pattern of final <-VCCe>, and the two disyllables 
have final-syllable stress. 


5. The phoneme-grapheme 
correspondences of 
English, 2: Vowels 


5.1 The general picture: the principal 
spellings of English vowel phonemes 


This chapter can be summed up by saying that only five of the vowel 
phonemes of RP /a, e, 0, au, jur/ have highly regular spellings (80%+) 
wherever they occur, while none of the other 15 has a spelling accounting 
for more than 60% of its occurrences (though see section 5.4.3 for the 
possibility that /1/ may also belong in the highly regular group). 

The main regularities for all 20 vowel phonemes, plus /jur/, are 
summarised in Table 5.1, by position in the word. The letter-name 
vowels /eI, ix, al, aU, jur/, plus /ur/, need to be analysed according to 
position within non-final vs final syllables (and then, within final syllables, 
according to two further, crossed dichotomies; see also sections 6.2 and 
6.3), whereas all the rest need to be analysed according to position as initial, 
medial or final phoneme. The phonemes are therefore classified into short 
pure vowels, long pure vowels other than /i:, ux/, diphthongs other than 
/eI, aI, aU/, and the letter-name vowels plus /ur/. 

Amidst the clutter of Table 5.1 various generalisations can be discerned: 
five of the short pure vowels have a predominant spelling in initial 
position (/u/ does not occur in this position, in RP) 
the letter-name vowels and /u:/ have remarkable consistency in non- 
final syllables (with the notable exception of /i:/ in unstressed syllables) 
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the letter-name vowels and /u:/ mostly have split digraph spellings 
in closed final syllables (with the notable exceptions of /i:, ux/ in 
monosyllables) 

there is a minor and scattered pattern before consonant clusters: in 
closed monosyllables the long vowel /a:/ and the letter-name vowels 
/al, aU/ are spelt with the single letters <a, i, o> and the letter-name 
vowel /er/ is spelt <ai> before /nt/; for /a:, er/ this pattern extends 
to closed final syllables of polysyllables 

digraphs with <-y> (<ay, oy>) tend to occur word-finally and to 
alternate with digraphs with <i> (<ai, oi>) elsewhere 

similarly, digraphs with <-w> (<aw, ew, ow>) tend to occur word- 
finally and to alternate with digraphs with <u> (<au, eu, ou>) 
elsewhere 

the biggest muddle is /5:/. 


TABLE 5.1: MAIN SPELLINGS OF THE 20 VOWEL PHONEMES, PLUS /ju:/, 
BY WORD POSITION. 


VOWELS OTHER THAN THE LETTER-NAME VOWELS AND /u:/ 


Vowel Position 
Initial . . 
Medial phoneme Final phoneme 
phoneme 
Short pure vowels 
/xe/ <a> (does not occur) 
/e/ <e> (does not occur) 
/1/ <i>, but <i>, but <a> in word-final (does not occur) 
frequently <e> | unstressed /1d3/, and 
in unstressed frequently <e> in other 
syllables unstressed syllables 
/o/ <o> <o>, but mainly <a> after /w/ | (does not occur) 
/A/ <u>, but there are many examples with <o> (does not occur) 
/v/ (does not <oo> in monosyllables ending | (does not occur) 
occur) in /d, k/, <u> elsewhere 
/a/ <a>, with few <a>, with many exceptions <er>, with many exceptions 
exceptions 


Long pure vowels other than /i:, u:/ 


/ax/ 


<ar>, but <a> before consonant clusters (nothing predominates) 


/31/ 


(very rare) <er>, but <or> after initial /w/ | <er> 
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/o:/ <au, or>, but <au, aw, or, ore>, but mainly <aw, or, ore> 
mainly <a> <a> after /w/ and before /I/ 
before /I/ 
Diphthongs other than /e1, ar, au/ 
/o1/ <oi> <oy> 
/au/ <ou> <ou>, but <ow> before /I, n/ <ow> 
and vowel letters 
/ea/ <air> <ar> <are> 
/1a/ (very rare) <er> <ear> 
/ue/ (so rare and diverse that no generalisations are worthwhile) 


THE LETTER-NAME VOWELS, PLUS /u:/ 


In final syllables 


Nene In non-final Closed Open 
syllables In In In In 
polysyllables | monosyllables | polysyllables | monosyllables 
/et/ <a> <ai> before /nt/, otherwise 
<a.e>, with many exceptions <ay> 
with <ai> 
/ix/ mainly <e>; <e.e>, <ee>, with 
many excep- | with many many <y>, with <ee>, with 
tions with <i> | exceptions exceptions many some 
in unstressed exceptions exceptions 
syllables 
/at/ <i> <i.e> <i> before <y>, with <y>, with more 
consonant a few exceptions 
clusters, <igh> | exceptions, than examples 
before /t/, mostly with 
otherwise <i.e> | <i> 
/au/ <o> <0.e> <o> before <ow> in two- | <ow> 
consonant syllable words 
clusters, after /I, r/, 
otherwise otherwise 
<o0.e> <o> 
/jux/ <u> <u.e> <ue> <ew> 
/ux/ <u> <oo> in <oo> <oo> <ew> 
stressed 
final /'uzn/, 
otherwise 


<u.e> 
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5.2 Order of description 


In sections 5.4-7 | set out the vocalic phoneme-grapheme correspondences 

between RP and British spelling, under the vowel phonemes listed in the 

order in which they appear in Table 5.1. Section 5.4 covers short pure vowels, 
section 5.5 long pure vowels other than /i:, ur/, section 5.6 diphthongs 

other than /e1, aI, au/, and section 5.7 the letter-name vowels plus /u:/. 
Under each vowel phoneme | deal with the spellings in this order: 

1) The basic grapheme. In my opinion, each of the 20 vowel phonemes 
of English, plus /jur/, has a basic grapheme, the one which is most 
frequent and/or seems most natural as its spelling. 

2) Other graphemes which are used to spell that phoneme with reasonable 
frequency. 

3) Oddities, graphemes which are used to spell that phoneme only rarely. 

4) Any 2-phoneme graphemes in which the phoneme occurs. (Almost all 
the 2-phoneme graphemes are also Oddities, but a few belong to the 
main system and are included there). 

5) Any 3-phoneme grapheme in which the phoneme occurs. Both 
3-phoneme graphemes are definitely Oddities. 

Most entries end with Notes, and some have Tables. 

By reasonable frequency here | usually mean at least 9% of the occurrences 
of that phoneme in running text. The reason for setting a generally higher 
criterion for vowel spellings than for consonant spellings (see section 3.2) 
is that vowel spellings are so much more varied. For the choice of 9% see 
in particular />:/ spelt <au, aw> and /u:/ spelt <ew> (sections 5.5.3, 
5.7.6), which definitely have to be considered parts of the main system of 
English spelling; and contrast (at 8%) /A/ spelt <ou>, /3:/ spelt <ear> and 
/>:/ spelt <our> (sections 5.4.5, 5.5.2, 5.5.3), which equally certainly are 
Oddities and not parts of the main system. However, as with consonant 
phonemes, the dividing line cannot be absolute. | have ‘promoted’ four 
infrequent correspondences, /p/ spelt <a> at 6%, /Ia/ spelt <eer> at 8%, 
/ju:/ spelt <ue> - percentage unknown, but clearly very low, and /u:/ spelt 
<ue> at <1% (sections 5.4.4, 5.6.4, 5.7.5-6), to the main system, where 
they obviously belong. In the case of /u:/ spelt <ue> this is largely because 
in the grapheme-phoneme direction the correspondences of <ue> are 
highly regular - see section 10.37. 

Again, the frequencies are Carney’s text frequencies (see section 3.2), 
but for /1, 1a/ | take issue with them, and for /i:/ | dispense with them 
completely - see sections 5.4.3, 5.6.4, 5.7.2. 
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5.3 The main system and the rest 


As for the consonant phonemes, under each vowel phoneme | separate the 
correspondences with graphemes into what | consider to be the main system 
and the rest. The correspondences | include in the main system are those 
which seem to me to operate as part of larger regularities, even though 
pretty rarely as absolute rules. For instance, there is a strong tendency in 
English spelling for the letter-name vowel phonemes /e1, it, aI, au, jur/, plus 
/u:/, to be written with the single vowel letters <a, e, i, 0, u> in non-final 
syllables. Within the main system | include only the correspondences which 
seem to me to form part of these larger regularities. For the vowel phonemes 
these comprise the basic correspondences and the correspondences which 
have reasonable frequency as I’ve defined it above, plus a few with lower 
frequencies which have to be in the main system, but not the 2-phoneme 
graphemes (with a few exceptions), or the 3-phoneme graphemes and 
Oddities. Correspondences which have reasonable frequency are shown in 
9-point type, the rest in 7.5-point. 


5.4 Short pure vowels: /2 €1D AU 3/ 


N.B. Six of the seven short pure vowels do not occur word-finally, but /a/ is 
frequent in that position. 


5.4.1 /#/ as in ash 


Does not occur word-finally. 


THE MAIN SYSTEM 


Basic grapheme <a> 99% e.g. cat 


Other frequent graphemes (none) 


THE REST 

Oddities 1% in total 
<ae> only in Gaelic pronounced /'gzl1k/ 
<ai> only in Laing, plaid, plait 


<al> only in salmon 
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<ei> 


<i> 


2-phoneme graphemes (none) 


5.4.2 /e/ as in end 


only in reveille 


only in absinthe, impasse, ingenu(e), 
lingerie pronounced /'lengari:/ (also 
pronounced /'londgare1/), meringue, 
pince-nez, timbale, timbre 


Does not occur word-finally, and is rare before /n/ - see section 3.8.2. 


THE MAIN SYSTEM 


Basic grapheme <e> 
Other frequent graphemes (none) 
THE REST 
Oddities 

<a> 


84% e.g. pet 


16% in total 


only in any, ate pronounced /et/ (also 
pronounced /ert/), many, Thames, first <a> 
in secretaria-I/t, second <a> in asphalt 
pronounced /'zJfelt/ (also pronounced 
/‘esfelt/), and 

- a few words ending <-ary> with the 
stress two syllables before the <a>, 
€.g. necessary, secretary, pronounced 
/'nesaserit, 'sekrateri:/ (also pronounced 
/'nesasri:, 'sekratriz:/ with no vowel 
phoneme corresponding to the <a> - for 
the elided vowels in this and the next three 
paragraphs see section 6.10) 

- a few adverbs ending <-arily>, e.g. 
militarily, necessarily, primarily, voluntarily 
pronounced /mulr'tertli:, nesa'sertli:, 
prar'mertli:, volan'tertli:/ with the 
<a> stressed (also pronounced either 
/mult'teariliz, nesa'seartli:, prar'meertliz, 
volan'teartli:/ with /ea/ spelt <ar> and 
the <r> also a grapheme in its own right 
spelling /r/ - for dual-functioning see 
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<ae> 


<ai> 


<ay> 
<ea> 
<ei> 


<eo> 


<ie> 
<u> 
2-phoneme graphemes (none) 


3-phoneme grapheme /eks/ 
spelt 
<x> 


NOTES 


section 7.1 - or reduced to four syllables 
as /‘multtrali:, 'nesasrali:, ‘prarmreali:, 

‘volantrali:/ with stress shifted one or two 
syllables forward, again no vowel phoneme 
corresponding to the <a>, and the vowel 
before /li:/ changed from /1/ to /a/) 

- temporary pronounced /'tempareri:/ (also 
pronounced /'temprari:/ with no vowel 
phoneme corresponding to the <o> and the 
<a> now spelling /a/) 

- temporarily pronounced /tempa'rertli:/ 
(also pronounced either /tempea'reartli: / 
with /ea/ spelt <ar> and the <r> alsoa 
grapheme in its own right spelling /r/ - for 
dual-functioning see section 7.1 - or 
reduced to three syllables as /'temprali:/ 
with no vowel phonemes corresponding 
to the <o> or the <a> and the two /r/ 
phonemes reduced to one) 


only in aesthetic pronounced /es'8et1k/ 
(also pronounced /i:s'@ettk/), haemorrhage, 
haemorrhoid 


only in bouillabaisse, said, saith and 
(usually, nowadays) again(st) 


only in says 
6% in about 60 words - see Note 
only in heifer, leisure, seigneur 


only in Geoff(rey), jeopardy, Leonard, 
leopard 


only in friend 


only in burial, bury 


only in X-ray, etc. 


There are only about 60 stem words in which /e/ is spelt <ea>, but no rule 


can be given to identify them, so here is a list: Beaconsfield, treacher-ous/y; 
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bread, breadth, dead, dread, (a)head, lead (the metal, plus derivatives leaded, 
leading), meadow, read (past tense and participle), Reading (Berkshire), 
(already, spread, (instead, steadfast, steady, thread, tread(le); deaf, 
breakfast; dealt, health, jealous, realm, stealth, wealth, zeal-ot/ous; dreamt, 
seamstress; cleanly (adjective, plus derivative cleanliness), cleanse, leant, 
meant, leapt, weapon, (a)breast, peasant, pheasant, pleasant, measure, 
pleasure, treasure, sweat, threat(en); breath, death, feather, heather, 
leather, weather, endeavour, heaven, heavy, leaven and other derivatives 
not listed. In my opinion very little would be lost if all these words were 
instead spelt with <e> - indeed, one spelling reform proposal is that the 
first change should be to spell all occurrences of /e/ with <e> and nothing 
else - but: 

this might be difficult for some of the Oddities above 

bred, led, red, lent, wether would become homonyms of the words 

already so spelt 

various words would have to acquire unfamiliar letters to conform to 

other rules: “tretcher-ous/y, “bredded, “bredding, ‘dredded, *dredding, 

“hedded, “hedding, “ledded, ‘ledding, Redding (so spelt in first map, 

1611), “(al)reddy, “reddying, “reddied, “reddies, “deff, “breckfast, 

*jellous, “zell-ot/ous, “weppon, *swetted, “swetting, “thretten(ed/ing). 


5.4.3 /1/ as in ink 


Does not occur word-finally, in my opinion/version of RP. For doubts about 
the percentages see Notes. 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic grapheme_ <i> only 61% if word- e.g. sit. Regular in initial 
final /1/ spelt <y> and medial positions 
is allowed, but a lot 
more otherwise 


Other frequent <y> 20% if word-final e.g. bicycle, crystal 
graphemes /1/ spelt <y> is 

allowed, but a lot 

less otherwise 
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<e> only 16% if word- e.g. diocese (first <e>), 

final /1/ spelt <y> England, English, enough, 

is allowed, but alot entirety (both <e>’s), 

more otherwise extreme (first <e>), 
pretty, scavenge (first 
<e>), stupefy, variety. 
Regular in certain 
suffixes 


THE REST 


Oddities at least 4% in total 
- in stressed syllables 
<ee> only in breeches /'britf{1z/ 


<hea> only in forehead pronounced /'forid/ 


<ie> only in sieve 
<o> only in women 
<u> only in business, busy 


- in unstressed syllables 


<a> in about 250 words ending in unstressed word-final 
/1d3/, which is mainly spelt <-age>, e.g. village, plus 
furnace, menace, necklace, octave, orange, signature, 
surface, spinach pronounced /'spinidg/ and second 
<a> in character, palace. See Notes 


<ai> only in bargain, captain, chamberlain, chaplain, 
fountain, mountain, porcelain 


<ee> only in been when unstressed, cheerio /bin, tftri:'jau/ 


<ei> only in counterfeit pronounced /'kaunteafit/ (also 
pronounced /'kauntofi:t/), forfeit, sovereign, surfeit 


<ia> only in carriage, marriage 

<ie> only in (hand/neo kerchief, mischief, mischievous 

<o> only in pigeon (taking <ge> as spelling /d3/; compare 
pidgin) 

<u> only in lettuce, minute (noun /'mintt/, ‘60 seconds’), 


missus 
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<wi> only in housewife (‘sewing kit’, pronounced /'haz1f/) 
2-phoneme grapheme /1z/ only, following an apostrophe, in regular singular and 

spelt irregular plural possessive forms ending in a sibilant 

<s> consonant (/s, z, J, 3, tf, d3/), e.g. Brooks’s (book), 


jazz’s (appeal), Bush’s (government), (the) mirage’s 
(appearance), (the) Church’s (mission), (the) village’s 
(centre), (the) geese’s (cackling). See /z/, section 3.7.8 


NOTES 


Carney (1994: 135, 139, 161, 380, 430) states that, in the version of RP 
which he analysed (and which he and many other phoneticians prefer to call 
SBS, Southern British Standard, and Cruttenden (2014) now dubs GB, General 
British), /1/ does occur word-finally and in that position is mainly spelt <y>. 
His percentages for the spellings of /1/ are based on that analysis. But he 
also points out (especially on pp.134-5, 380) that many (especially younger) 
RP-speakers do not have word-final /1/ in their accents, but (a short version 
of) /ix/ instead. And on page xxii he says that /1/ does not occur in final 
open syllables, thus contradicting most of his other statements on this (cf. 
also his p.56). 

Cruttenden (2014: 97; and cf. p.84), on the other hand, says: ‘Word-final 
unaccented /1/ has now been replaced in all but the oldest GB speakers by 
/it/ ...,¢.g. in copy‘. |agree, and think that children learning to spell English 
are more likely to hear the final phoneme of, say, city as /i:/ rather than /1/, 
and different from the definite /1/ in the first syllable. Similarly, Mines et al. 
(1978, Table A-2, p.237) show that only about 1% of occurrences of /1/ in 
their analysis were word-final (admittedly in the General American accent, 
but here the point is applicable to RP as well). | have therefore not followed 
Carney’s analysis, but Cruttenden’s, and count /1/ as occurring only in initial 
and medial positions. This does mean, unfortunately, that | have not been 
able to use Carney’s percentages (which, oddly, Cruttenden, 2014: 113) 
retains) without reservation, and he does not provide enough information 
to re-calculate them (this would require knowing what proportion of 
<y>-spellings are word-final). This difference in analysing /1/ also entails 
differences in the analysis of and percentages for /i:/ - see section 5.7.2. 

<i> is regular in initial and medial position in both stressed and 
unstressed syllables. 

Exceptions with <e>: The only words in which /1/ is spelt <e> in stressed 
syllables are England, English, pretty and Cecily pronounced /'sistli:/ and 
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therefore as a homophone of Sicily (Cecily is also pronounced /'sesili:/). 

Categories where <e> is the regular spelling of unstressed /1/ are: 
the past tense and past participle verb ending <-ed> spelling /1d/ 
after /t, d/, e.g. ousted, decided. N.B. Carney (1994: 135) says this 
ending is not included in his percentages 
a few adjectives which are derived from or resemble past participles 
but have /1d/ rather than the expected /d, t/, but often with a 
different meaning, e.g. accursed, aged (/‘e1dgid/ ‘elderly’ vs /e1d3d/ 
‘having x years’), beloved (/bi'lavid/ ‘the loved one’ vs /br'lavd/ 
‘adored’), blessed (/'blestd/ ‘holy’ vs /blest/ ‘consecrated’), cragged, 
crooked (/'krukid/ ‘untrustworthy’ vs /krukt/ ‘at an angle’), Crutched 
(Friars), cursed (/'k3:std/ ‘damnable’ vs /k3:st/ ‘swore badly/put a 
hex on’), cussed (/'kasid/ ‘stubborn’ vs /kast/ ‘swore mildly’), deuced, 
dogged (/'dvgid/ ‘persistent’ vs /dogd/ ‘followed’), fixed (/'fiksi1d/ 
‘persistent’ vs /fikst/ ‘mended’), horned (owl), jagged (/'dgegid/ ‘with 
sharp points’ vs /dgjzgd/ past tense of jag), learned (/'l3:n1d/ ‘wise’ vs 
/I3:nd/ regular past tense of learn), (bow/one-)legged, naked, ragged 
(/‘reegid/ ‘torn, exhausted’ vs /regd/ past tense of rag), rugged, 
sacred, supposed (/sa'pauzid/ ‘apparent’ vs /sa'pauzd/ past tense of 
Suppose), wicked, wretched. |In (ac)cursed, blessed, crooked, Crutched, 
cussed, deuced, fixed, wretched, not only does the /1/ surface (see 
section 7.2) but the /t/ voices to /d/ 
the past participle verb ending <-ed> spelling /1d/ before adverbial 
<-ly>, e.g. advisedly, allegedly, assuredly, barefacedly, composedly, 
confusedly, deservedly, determinedly, fixedly, markedly, relaxedly, 
(un)reservedly, supposedly, unabashedly, unashamedly, undisguisedly, 
unrestrainedly. Again, in barefacedly, fixedly, markedly, relaxedly, 
not only does the /1/ surface (see section 7.2) but the /t/ voices to /d/ 
the <ed> element in a very few nouns in <-ness> formed from past 
participles, e.g. preparedness, where not only does the /1/ surface 
(see section 7.2) but also /r/-linking occurs (see section 3.6) and the 
<r> is both part of the grapheme <are> spelling /ea/ and a grapheme 
in its own right spelling /r/. For dual-functioning see section 7.1 
the superlative adjective ending <-est>, e.g. biggest, grandest 
the archaic second and third person singular verb endings <-est, -eth>, 
e.g. gavest, goeth 
the noun plural and third person singular present tense verb endings 
/1z/ sometimes spelt <-es> after /s, z, J, 3, tf, d3/ - see the entries 
for those consonants in sections 3.6.6, 3.6.8, 3.7.3, 3.7.4, 3.6.2, 
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Then 


3.6.4. N.B. Carney (1994: 135) says this ending is not included in his 
percentages either 

the unstressed noun suffixes /1s, l1s, ltt, nis/ spelt <-ess, -less, -let, 
-ness>, e.g. goddess, listless, booklet, madness. There are many nouns 
ending in unstressed /1s/ when it is not a suffix and is therefore 
not spelt <-ess>, e.g. furnace, menace, palace (all of which have 
alternative pronunciations in /as/), diocese (which is exceptional in 
various ways - see under /i:/, section 5.7.2), justice, practice, mortice/ 
mortise, practise, premise/premiss, promise, treatise and all nouns 
ending in /sis/, e.g. crisis. The set of words ending in stressed /1s/, 
all but one spelt with <-iss>, is very small: amiss, bliss, dismiss, diss, 
hiss, kiss, miss, p ss, remiss, Swiss (exception: abyss) 

the unstressed prefixes (Germanic) /b1/ and (Latin) /d1, 1, 1ks/1gz, prt, 
rI/ spelt <be-, de-, e-, ex-, pre-, re->, e.g. before, beholden, decline, 
deliver, effective, efficient, extreme, examine, precede, predict, regale, 
reject 

the ending /iti:/ when the previous letter is <i>, i.e. in anxiety, 
dubiety, gaiety, moiety, notoriety, (im)piety, (im)propriety, sobriety, 
society, variety, plus entirety, naivety, nicety, surety. The last four 
words are exceptions to the general rule that the ending /t1ti:/ is spelt 
<-ety> only when the previous letter is <i>, otherwise <-ity>, e.g. 
nullity, paucity, including cases where this involves <e>-deletion 
or <y>-replacement or -deletion (e.g. scarcity, laity - see sections 
6.4-6); but the more regular spellings “entirity, “naivity, “nicity, “surity 
would look odd, as would “layity. In entirety, surety, /r/-linking occurs 
(see section 3.6) and then the <r> is both part of the graphemes 
<ir, ur> spelling /ata, vua/ and a grapheme in its own right spelling 
/r/. For dual-functioning see section 7.1. The spelling of the ending 
/1tix/ with <e> when the previous letter is <i> is one of the ways in 
which English spelling avoids the sequence <ii>, which appears to be 
tolerated only in alibiing, fasciitis, leylandii (and probably many other 
biological species names), Pompeii, radii, shanghaiing, Shiite, skiing, 
taxiing (all of which have an automatic intervening /j/-glide). 

there are groups of words with /1/ spelt <e> where no rule can be given: 
the ending /1t/ is normally spelt <-it> (e.g. rabbit and about 200 
other words) but is spelt <-et> in, e.g., ashet, brisket, budget, buffet 
(‘strike’), dulcet, facet, fillet, gannet, gullet, nugget, plummet, punnet, 
russet, secret, valet (also pronounced with /e1/) and about 150 other 
words, so all words with both endings just have to be learnt 
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the ending /1far(d)/ is spelt with <e> rather than <i> in just four 
words: liquefy, putrefy, rarefied, stupefy. These endings are normally 
spelt <-if-y/ied> (e.g. nullify, pacified), including cases where this 
involves <e>-deletion or <y>-replacement (e.g. amplify, jollify - see 
sections 6.4-5); liquefy has the alternative spelling /iquify, but the 
more regular spellings “putrify, “rarified, “stupify would look odd 
a ragbag of other words, e.g. allegation, employ, forest, hallelujah, 
integral (when pronounced /'tntigral/, with stress on first syllable; 
also pronounced /in'tegral/, with stress on second syllable), kitchen, 
mannequin, regalia, subject (noun /'sabdgikt/, with stress on first 
syllable; the verb is pronounced /sab'dgekt/), vinegar, first <e> in 
anecdote, antelope, barometer and all the instruments ending in 
<-ometer> (but not kilometre or other compounds of metre), celebrity, 
consecrate, eccentric, ellipse, elope, enamel, integrate, negate, neglect, 
Scavenge, sequential; second <e> in elegant, elephant, elevate, 
peregrine, and many others. 
Initial /1/ spelt <y> is extremely rare, occurring only in the archaic word 
yclept (‘named’), and the names of the plant and essential oil ylang-ylang 
(also spelt ilang-ilang), the type of boat yngling, and the elements ytterbium, 
yttrium and the names Yvette, Yvonne. Only in yngling, yttrium is it stressed. 
Other exceptions with /1/ spelt <y> are all medial and mainly of Greek 
origin. No generalisations seem possible about contexts in which medial /1/ 
is spelt <y> rather than <i>, so here is a list: abyss, acetylene, acronym, 
amethyst, analysis, analytic, aneurysm (also spelt aneurism), antonym, 
apocalypse, apocrypha(l), asphyxiate, beryl, bi/tri-cycle, calyx, cataclysm, 
catalyst, chlamydia, chlorophyll and a few other words ending in -phyll, 
coccyx, cotyledon, crypt(ic), crystal, cyclamen, cygnet, cylinder, cymbal, 
cynic, cyst, (ptero)dactyl, di/tri-ptych, dynasty (first syllable), eponym, 
etymology (second syllable), eucalyptus, glycerine, gryphon, gymkhana, 
gym(nast/ium), gyp, gypsum, gypsy (first syllable), hieroglyph, hydroxyl 
(last syllable), hymn, hypnosis, hypnotise, hypocrisy (first syllable), hypocrite, 
idyll, larynx, lymph, lynch, myriad, metempsychosis, nymph, onyx, oryx, 
oxygen, paralysis, paralytic, paroxysm, pharynx, phylactery (first syllable), 
physics, polygamy (second syllable), polymer, polyp, pygmy (first syllable), 
pyx, rhythm, salicylic, sibyl, strychnine, sybarite, sycamore, sycophant, 
syllabic, syllable, syllabub, syllabus, sylloge, symbol, sympathy, syndicate, 
synonym (first and last syllables), syntax, synthetic, syphilis, tryst, tyranny 
(first syllable). In my opinion, nothing but a source of confusion would be 
lost if all such words were spelt with <i>. 
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There are fairly clear rules for spelling word-final /1d3/. When stressed, 
its regular spelling is <-idge> and there appear to be no exceptions, but 
there are very few words in this set: (a)bridge, fridge, midge, ridge. When 
unstressed, the regular spelling is <-age>, e.g. cabbage, disparage, garage 
pronounced /'gzrid3/, image, mortgage, village and about 250 other words. 
However, here there are several exceptions, with various spellings: carriage, 
college, (ac)knowledge, marriage, ostrich, privilege, sacrilege, sandwich 
pronounced /'seemwid3/ (and many other placenames with this ending - but 
I’m not dealing with placenames), selvedge, spinach pronounced /'spinid3/, 
vestige, plus three words with, confusingly, the regular stressed spelling: 
cartridge, partridge, porridge (and the last of these is even more confusing 
because the alternative spelling porage has the regular unstressed ending). 


5.4.4 /D/ as in ox 


Does not occur word-finally, or in US accents. 


THE MAIN SYSTEM 


Basic grapheme <o> 92% e.g. long 


Other frequent grapheme <a> 6% e.g. squash, wash, what. See 
Notes 


THE REST 


Oddities 2% in total 
<ach> only in yacht 


<au> only in Aussie, Australia, Austria, because 
(also increasingly pronounced, unusually, 
with stressed /2a/), cauliflower, laurel, 
Laurence, sausage, plus a few words also 
pronounced with /2:/: auction, austere, 
caustic, claustrophobi-a/c, hydraulic, 
(bacca) laureate 


<e> in about 20 more recent French loanwords, 
e.g. (the relevant <e>’s are in caps) 
ambiEnce, cliEntele, denouemEnt, détEnte, 
divertissemEnt, Embonpoint, Embouchure, En 
(suite), Enceinte pronounced /pn'sent/ (also 
pronounced /en'setnt/), Enclave pronounced 
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/‘onkletv/ (more often pronounced 
/‘enkletv/), Encore, Ennui, EnsEmble, 
EntEnte, Entourage, Entracte, Entrepreneur, 
Entree, Envelope pronounced /'pnvalaup/ 
(also pronounced /'envalaup/), gEnre, 


rapprochemEnt, rEntier. See Notes 


<eau> only in bureaucracy, bureaucratise 

<ho> only in bonhomie, honest, honour and 
derivatives 

<i> only in lingerie pronounced /'londgare1/ (also 
pronounced /'‘lenzari:/) 

<ou> only in cough, hough, lough, trough 

<ow> only in (ac) knowledge, rowlock 

2-phoneme graphemes (none) 


NOTES 


If we follow Crystal (2012: 131-2) and Upward and Davidson (2011: 176-9), 
‘more recent’ in terms of loanwords from French means after the Great Vowel 
Shift, which began about AD1400 and was complete by about AD1600. 
There is a reasonably strong tendency for /v/ to be spelt <a> after /w/, 
however spelt - see Table 5.2, and cf. />:/, section 5.4.3. The reason for 


putting this correspondence in the main system despite its percentage is 


that Carney will have excluded the two high-frequency function words what, 


was, and these are enough to make this a frequent correspondence. 


TABLE 5.2: SPELLINGS OF /po/ AS <a> AFTER /w/. 


after /w/ spelt <u> after /w/ spelt <w> after /w/ 

- always after /k/ spelt <q> spelt 
<wh> 

(e) quality, quad and derivatives, swab, swaddle, swallow, swamp, swan, what 


quadrille, quaff, quag(mire) (also 
pronounced /'kweg(-)/), quagga 
(also pronounced /'kwega/), qualify 
and derivatives and associates, 
quandary, quant and derivatives, 
quarantine, quarrel, quarry, quash, 
quatrain, squab(ble), squad and 
derivatives, squal-id/or, squander, 
squash, squat 


swap, swash(buckling), swastika, swat, 
swatch, twaddle, wad, waddle, waddy, 
wadi, waffle, waft (also pronounced 

with /2:/), wallah, wallet, wallop, wallow, 
wally, walrus (also pronounced with /3:/), 
wampum, wan, wand, wander, wannabe, 
want, wanton, warrant, warren, warrigal, 
warrior, was, wash, wasp, wassail, wast, 
watch, watt, wattle 
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TABLE 5.2: SPELLINGS OF /p/ AS <a> AFTER /w/, CONT. 


Exceptions (words in which /p/ is spelt <o> not <a> after /w/) 


quod, quondam 


wok, wombat, wonk, wonton, wop, wot 


swop, swot, wobble, wodge, wog, woggle, whop(per) 


Other words in which /p/ is spelt <a> are ambience, bandeau, blancmange 


(second <a>), bouffant, chanterelle, confidant(e), debutante, diamante 


(second <a>), fiance(e), flambe, flambeau, insouciance, jalap (first <a>), 


mange-tout, moustache (now mostly pronounced with /a:/ in RP), nuance, 


scallop (also pronounced /'skelap/), seance, stalwart (first <a>), wrath 


pronounced /rv@/ (also pronounced /rd:6/). Elderly relatives of mine (born 


about 1880) would say /'pvlbat/ (‘Olbat’) when referring to Victoria’s consort. 


For more detail on the absence of /v/ in US accents see Cruttenden 


(2014: 127) and Carney (1994: 59). 


5.4.5 /A/ as in up 


Does not occur word-finally in RP, and does not occur at all in local accents 


of the north of England. 


THE MAIN SYSTEM 


For both categories see Notes. 


Basic grapheme 


Other frequent grapheme 


THE REST 


Oddities 


<u> 


<O> 


<oe> 


<oo> 


<ou> 


63% e.g. dulcimer, up 


27% e.g. above, monk 


10% in total 
only in does(n’t) 
only in blood, flood 


8% only in chough, Colclough pronounced 
/‘kaulklaf/ (also pronounced /'kaukli:/), 
country, couple, couplet, courage, cousin, 
double, doublet, enough, flourish, “hiccough 
(properly spelt hiccup), housewife (‘sewing 
kit’, pronounced /'hazif/), nourish, rough, 
Slough (‘shed skin’), sough, souther-n/ly, 
touch, tough, trouble, young 
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2-phoneme grapheme /wa/ only in once, one 
spelt 
<o> 

NOTES 


Many people from the North of England do not have this phoneme in their 
accents, but retain the earlier /u/ (see next section) in most of the words 
in which RP has /A/. So far from simplifying their task, this presents them 
with three principal ways of spelling /u/: <o, 00, u>. Also, in some northern 
accents some words in which RP has /a/ are pronounced with /p/, e.g. one, 
among, nothing. 
For spelling RP /A/ some regularities can be stated: 

<o> is regular before /8, 6, v/: doth, nothing (is she sweet?); brother, 

mother, other, smother, above, coven, covenant, (dis/re/un)cover, 

covert pronounced /'kav3:t/ (also pronounced /'kuav3:t/), covet(ous), 

covey, dove, glove, govern(or), lovage, lovat, love, Lovell, oven, plover, 

shove, shovel, slovenly, windhover (exceptions: southern, guv) 

<u> is regular before /b, d, g, d3, k, p, J, t{/: e.g. club, hub, public, 

bud, mud, shudder, buggy, juggle, luggage; budget, cudgel, judge; 

buxom, duck, luxury, abrupt, cup, supper, blush, thrush, usher, clutch, 

duchess, much (exceptions: amok, Cadogan, conjure (‘do magic’), 

pinochle, sojourn (also pronounced with /»/), twopence, twopenny; 

country, couple, cousin, double(t), doubloon, touch, trouble). 
Since there are no other useful generalisations it seems best to give a list of 
other words with /a/ spelt <o>: accomplice, accomplish, become, borough, 
colour, colander (also pronounced with /v/), Colombia (second syllable), 
come, comfort(able), comfrey, comfy, company, (en)compass, constable, 
coz, cozen, done, dost, dozen, dromedary, front, frontier, honey, London 
(first syllable), Monday, monetary, money, monger and its compounds, 
mongrel, monk, monkey, Monroe, Montgomery (twice), month, none, onion 
(first syllable), some, somersault, son, sponge, thorough, ton, tonne, tongue, 
won, wonder, worrit, worry. Some words which used to have /a/ in RP now 
have /v/ instead, e.g. combat, comrade, conduit, Coventry. 

/wa/ also has 2-grapheme spellings, e.g. <wo> in wonder. 


5.4.6 /u/ as in pull 


Occurs only medially (in RP), and never before /n/ (in RP). 
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THE MAIN SYSTEM 


For both these categories see Notes. 


Basic grapheme <oo> 64% e.g. hood, look 

Other frequent grapheme <u> 32% e.g. cushion, push 

THE REST 

Oddities 4% in total (probably an underestimate 


because Carney will not have counted could, 
should, would) 


<o> only in bosom (first <o>), wol-f/ves, 
wolfram, wolverine, Wolverhampton, 


woman 
<or> only in worsted (‘cloth’) 
<ou> only in courier, pouffe pronounced /puf/ 


(also pronounced /pu:f/) 
<oul> only in could, should, would 


2-phoneme graphemes (none) 


NOTES 


In RP (as distinct from local accents of the north of England, in which /u/ is 
much more frequent) there are rather few words containing this phoneme, 
perhaps only about 80 stem words, plus a potentially much larger set of 
adjectives and nouns ending in /ful/ spelt <-ful>. 

<oo> spelling /u/ occurs in only about 28 stem words, namely four words 
which have alternative pronunciations with /ur/: food /fud, fu:d/, hoodlum 
/‘hudlam, ‘'hu:dlam/, room /rum, ru:m/, woofer /'wofa, 'wu:fa/ (cf., as 
mentioned under Oddities, pouffe, though with a different grapheme), plus 
Chinook, forsook, foot, gooseberry /‘guzbri:/, hoof (and its plural hooves), 
poof(tery), soot, woof (/wuf/ ‘barking’; contrast woof /wu:f/ ‘weft’), wool 
and monosyllables ending in /d, k/: good, hood (plus its use as a suffix, 
e.g. childhood - but for hoodlum see above), stood, wood (and its derivative 
woodbine); book, brook, cook, crook, hook, look, nook, rook, shook, took 
(exceptions: could, should, would; pud, suk). The high percentage of <oo> 
spellings despite it occurring in so few words is due to some of those words 
having very high frequency. 

<u> is regular everywhere except in the <oo> words and Oddities 
listed above. However, there are only about 57 stem words in this set in 
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RP: ambush, Buddha, buffet /‘bufe1/ (‘food’), bulbul (twice), bull, bullace, 
bullet, bulletin, Bullingdon, bullion, bully, bulrush (first <u>), bulwark (also 
pronounced with /a/), bush, bushel, butch, butcher (but one of the teachers 
of English at my grammar school in the 1950s said /'batfa/), cuckoo, (mea) 
culpa, cushion, cushty, cushy, ebullient (also pronounced with /A/), fulcrum 
(both <u>’s), full, fulmar, fundi (/'‘fundi:/ South and East African English 
for ‘expert/skilled person’/in Britain, a member of the fundamentalist, 
uncompromising wing of the German Green Party; contrast fundi /'fandat/, 
plural of fundus ‘inner corner of organ’), gerenuk, kaput, kibbutz, kukri, 
lungi, lutz, mullah, mush (/mvfJ/, slang for ‘friend’), muslim, Musulman 
(twice), pud, pudding, pull, pullet, pulpit, push, puss, put, putsch, schuss, 
s(cjhtum, shufti, sputnik, sugar, suk, Sunni, thurible, thurifer, thruppence, 
tuk-tuk (twice), umlaut (first <u>), Zumba, plus derivatives including 
Buddhism, bullock, fulfil, fully, fulness, fulsome, and the adjective/noun 
suffix /ful/ spelt <-ful> - there are at least 150 words so formed, e.g. 
beautiful, handful. Unstressed in that suffix and otherwise only in ambush, 
fulcrum (second syllable), fulfil, gerenuk, tuk-tuk (second syllable). 

For elision of the /u/ when /li:/ spelt <-ly> is added to adjectives in 
<-ful> to form adverbs see section 6.10. 


5.4.7 /a/ (the schwa vowel) as in the first sound 
in about 


The most frequent phoneme in spoken English. 

The only short vowel which does occur word-finally, in my opinion/ 
version of RP. 

Occurs only in unstressed syllables, except in the increasingly frequent 
pronunciation of because as /b1'kaz/. 

Rare before /n/ - see section 3.8.2. 

N.B. Where the schwa vowel is part of a diphthong it is dealt with 
elsewhere - see /ea, 13, Ua, aU/, sections 5.6.3-5, 5.7.4. For the so-called 
‘triphthongs’ /ata/ (which I analyse as a two-syllable sequence consisting of 
a diphthong plus schwa) and /auwa, d1ja/ (which | analyse as two-syllable 
sequences consisting of a diphthong plus automatic intervening /w/- or 
/j/-glide plus schwa), see sections 5.7.3, 5.6.2 and 5.6.1 respectively. For 
this treatment of triphthongs, see also Cruttenden (2014: 153). 

For all categories see also Notes, and for further guidance sections 6.7-9. 
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THE MAIN SYSTEM 


Basic grapheme <a> 


Other frequent 
graphemes 


<O> 


<er> 


<e> 


Rare 2-phoneme_ /al/ 


grapheme 


THE REST 


Oddities 


- in medial position 


Spelt 
<-le> 


<ai> 


<anc> 


35% 


19% 


15% 


e.g. about. Regular in initial and medial 
positions. Especially prevalent in initial 
position, where the only exceptions 
appear to be words formed from the 
Latin prefix ob- and its derivatives, 
e.g. obscure, obtuse, occur, offend, but 
medial position is much more variable 


e.g. Burton, obscure 


e.g. alter. Regular in word-final position 
and in the prefixes hyper-, inter-, per-, 
super- when not stressed on <er>. All 
these prefixes permit /r/-linking (see 
section 3.6) before stems beginning 
with a vowel phoneme, e.g. hyper-active, 
interactive, peroxide, supererogatory 


e.g. artery 


only word-final and only in this reversed 
spelling, e.g. able, possible. Though 

not very frequent as a correspondence 
for /a/ this counts as part of the main 
system because of its higher frequency 
as a correspondence for /I/ - see 
section 3.6.5 


12% in total. Of all those listed, only <ar, or, ur> 
occur in both medial and final positions. None 
occur in initial position (see above under Basic 
grapheme, and Notes) 


only in certain, chieftain, coxswain, curtain, 
mainsail (second syllable), topsail, villain 


only in blancmange /bla'monds/ 


<ar> 


<eau> 


<ei> 


<eo> 


<eu> 


<i> 


<ia> 


<io> 
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regular in the suffixes /wad(z)/ spelt 
<-ward(s)>, e.g. afterwards, backwards), 
downwards), forwards), froward, inward, 
leeward, onward, outward, windward, and 
predominant in the ending /ad/ more generally 
- see Notes. Otherwise in an unpredictable 
ragbag of words, e.g. anarchy, awkward, 
bastard, billiards, blackguard pronounced 
/'blegead/ (also pronounced /'blega:d/), 
bombardier, bulwark, coward, custard, dotard, 
gabardine, halyard, innards, lanyard, monarch, 
mustard, niggardly, orchard, scabbard, 
stalwart, steward, vineyard, wizard 


only in bureaucrat(id 
only in foreign 


only in bludgeon, curmudgeon, dudgeon, 
dungeon, gudgeon, luncheon, puncheon, 
(e)scutcheon, smidgeon, sturgeon, surgeon, 
truncheon, widgeon. See Notes 


only in pasteurise pronounced /'parstfaraiz/ 


in a large number of adjectives ending in 

/abal/ spelt <-ible> where the stem without 
the /abal/ mostly does not sound like a real 
word, e.g. possible. See Basic grapheme <a> 
above, Notes and section 6.7. Also in a few 
adverbs ending <-arily> when not stressed on 
the <a>, which becomes elided (see section 
6.10), so that the <i> in <-ily> spells /a/, e.g. 
necessarily, voluntarily pronounced /'nesasrali:, 
‘volantrali:/ (see also under /e/. section 5.4.2) 


only in fuchsia, miniature, parliament. In words 
like crucial, initial and in Christian | count the 
<i> as part of a digraph with the preceding 
consonant letter - see //, t//, sections 3.8.3, 
3.7.2 


only in cushion, fashion, marchioness, 
stanchion. \n words like question, nation, lesion, 
vision, lotion, fusion | count the <i> as part of 
a digraph with the preceding consonant letter - 
see /tJ, J, 3/, sections 3.7.2, 3.8.3-4 
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<oar> 
<oi> 


<or> 


<ou> 


<ow> 


<ua> 


<ur> 


<y> 


only in cupboard, larboard, starboard 
only in connoisseur, porpoise, tortoise 


2% regular medially in prefix /fa/ spelt <for->, 
e.g. forbid, forget, forgive, forsake (but this is a 
very small set); otherwise rare medially, but cf. 
Deptford (and many other placenames with this 
element), Holborn, scissors, stubborn 


regular in adjectives ending in /as/ spelt 
<-ous>, e.g. anxious, famous. Otherwise only 
in camouflage, doubloon, limousine, moustache, 
tambourine, vermouth pronounced /'v3:ma@/ 
(also pronounced /va'mu:@/) 


only in Meadowhall (locally, in Sheffield), 
sorrowful 


regular in unstressed prefix /sab/ spelt 
<sub->, e.g. subdue, subject (verb, pronounced 
/sab'dgekt/), sublime, submerge, submit, 
subside, subsist, substantial, also in nouns 
ending in unstressed /as/ spelt <-us>. 
Otherwise in, e.g., bogus, capitulate, cherub, 
commensurate, congratulate, conjugate, 
glandular, modular, naturist, petulan-t/ce, 
postulant, spatula 


in nouns, only in actuary, estuary, mortuary, 
obituary, sanctuary, statuary, voluptuary, when 
pronounced with /tfari:/ rather than /tfuari:/ 
(see also under /tf{/, section 3.6.2), plus 
casualty, February, victuals /‘kezalti:, 'febrari:, 
‘vitelz/; also often in rapid pronunciation of 
adjectives like actual (see again /t{/, section 
3.6.2), sexual and especially adverbs derived 
from them (see also section 6.10 on elided 
vowels) 


perhaps usual only in Saturday, surprise, but 
there are several words which may have either 
/3:/ (see section 5.5.3) or /a/, e.g. liturgy, 
metallurgy, saturnine, surmise, surmount, 
Surpass, Survey (verb), survive 


only in pyjama(s) 


- in final position 


<ah> 


<ar> 


<ere> 


<eur> 


<or> 
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only in ayah, cheetah, fellah, haggadah, 
hallelujah, Hannah, loofah, messiah, moolah, 
mullah, mynah, pariah, purdah, (maha) rajah, 
Sarah, savannah, verandah, wallah and some 
other very rare words 


only in an unpredictable ragbag of words, e.g. 
altar, beggar, briar, burglar, cedar, cellar, 
cochlear, collar, columnar, curricular, familiar, 
friar, fulmar, globular, jugular, liar, linear, 
lumbar, lunar, molar, nuclear, particular, 
peculiar, pedlar, peninsular, planar, polar, 
popular, regular, scalar, scapular, scholar, 
sugar, titular, vicar, vulgar. Many such words 
permit /r/-linking, e.g. polarise, polarity - see 
section 3.6 


only in were when unstressed 


only in amateur, chauffeur (if stressed on first 
syllable and pronounced /'faufa/), grandeur. 
/r/-linking occurs in amateurish - see section 
3.6 


regular in nouns formed from verbs in <-ate>, 
e.g. administrator, agitator, alternator, 
commentator, creator, curator, dictator, 
elevator, incinerator, insulator, orator, 
spectator, including cases where the verb 

is rare, e.g. aviator, plus groups ending in 
/kta, esa, tta/ spelt <-ctor, -essor, -itor>, e.g. 
actor, conductor, constrictor, detector, reactor, 
aggressor, assessor, compressor, confessor, 
depressor, predecessor, possessor, professor, 
successor, capacitor, depositor, editor, inhibitor. 
Otherwise only in an unpredictable ragbag 

of nouns, e.g. advisor (also spelt advisen, 
camphor, conspirator, conqueror, contributor, 
conveyor, councillor, counsellor, distributor, 
donor, emperor, error, horror, incisor, inventor, 
languor, liquor, metaphor, pallor, pastor, 
phosphor, rotor, sailor, sponsor, squalor, 
stupor, suitor, survivor, terror, tormentor, 
torpor, traitor, tutor, plus <or> in rigor only in 
the Latin phrase rigor mortis 
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<ough> only in borough, thorough 


<our> only in an unpredictable ragbag of words, e.g. 
arbour, ardour, armour, behaviour, candour, 
clamour, clangour, colour, endeavour, favour, 
fervour, flavour, glamour, harbour, honour, 
humour, labour, neighbour, odour, parlour, 
rancour, rigour (but <or> in the Latin phrase 
rigor mortis), rumour, saviour, splendour, 
succour, tumour, valour, vapour, vigour. \n 
many of these words US spelling has <or> 


<re> only in an unpredictable ragbag of words, e.g. 
accoutre, acre, calibre, centre, chancre, fibre, 
goitre, litre, louvre, lucre, lustre, manoeuvre, 
massacre, meagre (contrast eager), mediocre, 
metre and its compounds, e.g. kilometre 
(contrast meter and its compounds, e.g. 
barometer), mitre, ochre, ogre, reconnoitre, 
sabre, saltpetre, sceptre, sepulchre, sombre, 
spectre, theatre, timbre. In many of these words 
US spelling has <er>. None of these words has 
a /r/ phoneme in the final syllable (in RP), but 
when a suffix beginning with a vowel is added, 
some lose /a/ and have /r/-linking instead; 
e.g. centre /'senta/ plus /al/ becomes /'sentral/ 
(central) - see section 3.6. In accoutrement the 
schwa disappears and two phonemes surface: 
/r/ spelt <r> and /1/ represented by the first 
<e> - see section 7.2. In acreage, massacreing, 
ochreous, ogreish /‘etkaridg, 'mzsakerin, 
aukearas, 'augarif/ /r/ also surfaces, but the 
schwa and /r/ seem to be represented by 

<e, r> in reverse order - see again section 7.2. 
Even more difficult to analyse is manoeuvrer 

if pronounced /ma'nu:vara/, where no letter 
seems to spell the first schwa - but, as Godel 
proved, no formal system can be both complete 
and consistent 


<ur> only in augur, femur, langur, lemur, murmur 
(second syllable), sulphur 


<ure> almost all examples of word-final /tfa/ are spelt 
<-ture>, e.g. architecture, capture, caricature, 
conjecture, creature (contrast preacher, 
teacher), culture, curvature, departure (contrast 
archer, marcher), expenditure, feature (again 


The phoneme-grapheme correspondences, 2: Vowels 159 


contrast preacher, teachen, fixture, fracture, 
furniture, future, gesture, juncture, lecture, 
legislature, literature, manufacture, miniature, 
mixture, moisture, nature, nurture (contrast 
lurcher, (re)searcher), pasture, picture (contrast 
pitcher), posture, puncture, rapture, rupture, 
scripture, sculpture, signature, stature, stricture, 
structure, temperature, texture, tincture, torture, 
(ad) venture, vulture. Other examples of final 

/3/ spelt <-ure> include censure, conjure 

(‘do magic tricks’) pronounced /'kandga/, 

figure, injure, leisure, measure, perjure, 
pleasure, pressure, procedure, seizure, tonsure, 
treasure, verdure (cf. verger); also in azure 
pronounced /'x3~, 'e1Za/ (also pronounced 
/‘ezje, ‘e1zja, 'ezjua, 'erzjua/). See Notes 


<yr> only in martyr, satyr, zephyr 
2-phoneme For /ata/ see under /ar/, section 5.7.3 
graphemes spelt <ir, ire, yr, 


yre> and /wata/ 
spelt <oir> 


For /auwa/ see under /au/, section 5.6.2 
spelt <hour, our> 


For /21ja/ see under /1/, section 5.6.1 

spelt <oir> 

/al/ spelt <I> only in axolotl, dirndl, shtetl. See /|/, section 
3.7.5 


/am/ spelt<m> see /m/, section 3.4.4 


/an/ spelt <n> see /n/, section 3.4.5 
/je/ 
(1) spelt <eu> only in aneurism/aneurysm, pasteurise 


pronounced /'pa:stjaraiz/ (also pronounced 
/‘paistfaraiz/) 


(2) spelt <u> frequent in unstressed penultimate syllables 
of words of three or more syllables stressed 
on the antepenultimate syllable, e.g. amulet, 
angular, argument, calculate, chasuble, 
coagulate, contributor, corpuscular, distributor, 
emulate, fabulous, garrulous, immunise, 
inaugural, incubus, insula-r/te, jugular, 
manipulate, muscular, nebulous, particular, 
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penury, populo)us, querulous, regula-r/te, 
scapula(n), scroful-a/ous, scrupulous, stimul- 
ant/ate/us, succubus, succulent, tremulous, 
truculent, vernacular, also in antepenultimate 
syllable of copulation, population with stress 
on following syllable. Where the preceding 
consonant is /d, t/ the sequences /dj, tj/ 
affricate to /d3, t{/ (see sections 3.7.4, 3.7.2 
and cf. pasteurise above), e.g. in (in)credulous, 
fraudulen-ce/t, glandular, modul-e/ar, nodul-e/ 
ar, pendulum, sedulous; century, congratulate, 
fistula, flatulen-ce/t, fortunate, petulan-t/ce, 
postulant, postulate, saturate, spatula, titular 


(3) spelt <ua> in my analysis, only in January, valuable - but 
see the discussion of words with <u, a> under 
<u>, section 10.36 


(4) spelt <ure> only in failure, tenure and azure pronounced 
/‘wezja, 'erzja/ (also pronounced /‘xzjua, 
'eIzjua, 'e39, 'e1Z9/) 
3-phoneme /watoa/ only in choir - one of only two 3-phoneme 


grapheme spelt with a single graphemes in the entire language 
grapheme <oir> 


NOTES 


Under /1a/ in section 5.6.4 you will see that | disagree with Carney’s analysis 
of that phoneme and have therefore re-allocated a large number of words 
to /i:/ plus /j/-glide plus /a/. However, this has not added any graphemes 
to the correspondences for /a/. | have left Carney’s percentages for /a/ 
unchanged on the assumption that the distribution of its correspondences 
within his analysis of /1a/ is broadly similar to that within his analysis of /a/. 

The articles a, the are pronounced /a, 6a/ before consonant phonemes 
in running speech, and sometimes also when pronounced as citation forms 
- and therefore stressed, thus also counting as partial exceptions to the 
rule that /a/ occurs only in unstressed syllables. But they also have the 
alternative citation forms /e1, di:/, which are not exceptions. Other function 
words which have /a/ in running speech, e.g. to, was, were pronounced 
/ta, waz, wa/, are never so pronounced as citation forms, which are instead 
/tur, woz, w3:/. 

The reason for the wide range of spellings for /a/ is that any vowel, 
however spelt for its full pronunciation, can be reduced to the non-distinctive 
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schwa in an unstressed syllable. The default spellings are <a> in initial and 
medial positions and <er> in final position, and some guidance can be 
given for a few major categories, but there are very many words that just 
have to be learnt - see the Oddities above and the various ragbag lists there 
and in these Notes. 


1. Initial position 
Here the hugely predominant spelling is <a>, and this applies both to the 
native English prefix a- (historically derived from on), e.g. in abide, aboard, 
about, ahead, alight, aside, athwart, away, and to derivatives of the Latin 
prefixes ab-, ad-, e.g. in abrupt, abhor, abound, acclaim, accost, accuse, 
acquire, address, adhere, adopt, affirm, aggressive, allure, annul, appear, 
assure, attend, aver, also in some words of other origins, e.g. (Greek) 
anaemia, anathema, aroma. 

The only set of exceptions appears to be words with /a/ spelt <o> inthe 
Latin prefix ob- and its derivatives, e.g. oblige, obscene, obscure, observe, 
obsess, obtain, occasion, occur, offend, official. 


2. Medial position 


Again the default spelling is <a>, though less strongly than in initial 
position. Some patterning can be seen in initial and final word elements, 
but very little otherwise in medial position. 


2.1 Medial position in prefixes/initial elements 


A few guidelines can be given for when a schwa here is not spelt <a>: 
the prefixes /‘harpa, ‘Inta, 'surpa/ are almost always spelt <hyper-, 
inter-, Super-> 
the unstressed prefixes /kan (and related forms), pra, ta/ are spelt 
with <o>, e.g. collect, collide, command, commit(tee), confess, 
connect, connive, connubial, contrast (verb, pronounced /kan'tra:st/), 
corrode, corrupt, procure, produce, profane, profess(or), prolong; 
today, together, tomorrow 
there are several words beginning <chloro-, micro-, mono-, phono-, 
photo-, saxo-> where the stress is on the first syllable and the schwa 
in the second syllable is spelt <o>. 


2.2 Medial position in suffixes /endings/final syllables 


The ending /at/ in many nouns and adjectives is almost always spelt 
<-ate> - see the list of about 90 words under /t/ spelt <te>, section 3.5.7 
(exceptions: chariot, idiot, patriot). 
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For those who say /i1t1v/ for words ending <-itive> the ending /ativ/ is 
always spelt <-ative>. 

The adjectival ending /abal/ is mainly spelt <-able> in words where 
the unsuffixed form sounds like a real word, and mainly <-ible> where it 
doesn’t, but there are numerous exceptions (see section 6.7). 

The adjective-forming suffix /al/ is usually spelt <-al>, e.g. central, 
liberal, loyal, royal; (arboreal, cereal, corporeal, ethereal, funereal, 
marmoreal, sidereal, venereal, congenial, editorial, industrial, jovial, 
managerial, material, memorial, radial, remedial, serial and about 450 
others ending in <-ial>. For the various spellings of final /al/ see also 
sections 4.4.3 and 4.4.2-3. 

There are fairly clear rules for word-final /am/: 

if preceded by /d/ the ending is usually the noun-forming suffix 
/dam/ spelt <-dom>, e.g. kingdom, thrall)\dom, wisdom (exceptions: 
agendum, carborundum, macadam, madam, referendum, sedum, 
tandem) 
if preceded by /z/ the spelling is almost always <sm> (only exception: 
bosom). See under /m/, section 3.4.4 
if preceded by /s/ the ending is usually adjectival /sam/ spelt <-some>, 
e.g. handsome (exceptions: balsam, flotsam, jetsam, besom, blossom, 
buxom, hansom, lissom, ransom, transom) 
otherwise word-final /am/ is usually spelt <-um>, e.g. atrium, 
bacterium, compendium, delirium, gymnasium, medium, opium, 
potassium, radium, stadium, tedium and about 200 others ending 
in <-ium>, plus album, colosseum, linoleum, lyceum, mausoleum, 
maximum, museum, petroleum, rectum (exceptions: algorithm, 
rhythm; amalgam, bantam, bedlam, buckram, gingham, marjoram; 
anthem, emblem, item, problem, stratagem, system, theorem, totem; 
atom, axiom, bottom, custom, fathom, idiom, maelstrom, phantom, 
pogrom, symptom, venom). 

There are fairly clear rules for word-final /as/: 
in adjectives the spelling is almost always <-ous>, e.g. famous and at 
least 2000 others (only exceptions: bogus, emeritus) 
in nouns the spelling is almost always <-us>, e.g. abacus, anus, bonus, 
cactus, Campus, caucus, census, Chorus, circus, citrus, Corpus, Crocus, 
discus, exodus, focus, fungus, genius, genus, hiatus, hippopotamus, 
isthmus, litmus, lotus, octopus, onus, nucleus, radius, rhombus, 
stimulus, surplus, syllabus, Taurus, terminus, tinnitus, virus and 


The phoneme-grapheme correspondences, 2: Vowels 163 


hundreds more (exceptions: (some of which are also pronounced with 
/1s/): furnace, menace, necklace, palace, pinnace, populace, solace, 
surface, terrace; alias, bias, Candlemas, canvas, Christmas, Lammas, 
Martinmas, Michaelmas, carcase/carcass, purchase; canvass, trespass, 
windlass, purpose; porpoise, tortoise) 
there seem to be only five pairs of adjective/noun homophones which 
differ only in the spelling of the /as/ ending: callous/callus, mucous/ 
mucus, populous/populace, rufous/Rufus, venous/ Venus, though of 
course nouns which are rank-shifted to modifier position before other 
nouns retain the <-us> spelling: chorus line, citrus fruit, litmus test 
there seem to be only two words ending /as/ which exist only as verbs: 
embarrass, harass (pronounced /'heras/ rather than the more recent 
/ha'rees/); the spellings of the other few verbs ending /as/ are the 
same as the related nouns: menace; bias, purchase; canvass, trespass; 
chorus, focus. 
The ending /ad/ is usually spelt <-ard>, e.g. awkward, bastard, blackguard 
pronounced /'blegad/ (also pronounced /'blega:d/), coward, custard, 
dotard, halyard, lanyard, mustard, orchard, scabbard, steward, vineyard, 
wizard and see Oddities above for the suffixes /wad(z)/ spelt <-ward(s)> 
(exceptions; method, period, synod). 

The endings /ak, ap/ are usually spelt <-ock, -op>, e.g. bollock, bullock, 
buttock, hassock, hillock, mattock, pillock, rowlock (exception: bulwark); 
bishop, gallop, wallop (exceptions: catsup, chirrup, ketchup, stirrup, syrup). 

In the suffix spelt <-ology>, the schwa after /I/ is always spelt <o>, e.g. 
biology, chronology. 

In the suffix spelt <-ological>, the schwa before the first /I/ is always 
spelt <o>, e.g. biological, chronological, and the second one always <a>. 

The ordinal numeral-forming suffix /a8/ is always spelt <-eth> in 
twentieth, ..., ninetieth. 

Beyond various words listed under the medial Oddities <ai, ei, eo, io, or> 
there are some fairly clear rules for word-final /an/: 

in the various endings pronounced /fan/, all words with <-si'n, -ti*n> 
have <o> for the schwa except Asian, Persian, Prussian, Russian, 
gentian, Titian; all words with <-ci’n> have <a> for the schwa except 
coercion 

the spelling <-on> otherwise occurs mainly in nouns, e.g. bacon, 
Briton, button, carton, chameleon, cotton, galleon, halcyon, matron, 
melodeon, mutton, Odeon, person, piston, siphon/syphon, wanton, 
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plus a set of words in <-ion>: accordion, aphelion, bastion, battalion, 
billion, bullion, carrion, centurion, champion, clarion, collodion, 
companion, criterion, dominion, ganglion, medallion, million, mullion, 
minion, oblivion, onion, opinion, pavilion, perihelion, pinion, rebellion, 
scorpion, scullion, stallion, union 

the irregular past participle ending /an/ (that is, when the ending is 
pronounced as a full syllable, namely after a consonant phoneme) 
is spelt <en>, e.g. (forbidden, bitten, broken, chosen, eaten, fallen, 
forsaken, frozen, (fongiven, hidden, (a)risen, spoken, stolen, swollen, 
(mis)taken, (a)woken, woven, written, even in fossilised forms where 
the stem verb is now regular or its past participle is disused or used 
only adjectively, e.g. beholden, bounden, brazen, cloven, drunken, 
graven, ((mis)be/ill-)gotten, laden, molten, proven, (bed)ridden, riven, 
rotten, (mis)shapen, shaven, shriven, shrunken, smitten, stricken, 
Stridden, striven, thriven, (down)trodden 

<en> also occurs in, e.g.; alien, dozen, even, flaxen, garden, golden, 
happen, heaven, listen, open 

<an> occurs in the noun/adjective ending /an/ in antipodean, 
caesarean, cyclopean, empyrean, epicurean, euclidean, European, 
galilean, Herculean, Jacobean, Linnaean, Manichaean, paean, 
pythagorean, plebeian, barbarian, comedian, grammarian, guardian, 
historian, pedestrian, reptilian, ruffian, thespian and about 200 other 
words ending in <-ian> 

But the endings /ant, ans, ansi:/ have the variant spellings <- ant/- 
ent, -ance/ -ence, -ancy/-ency> - see section 6.8. 


2.3 Otherwise in medial position 


The default spelling is still <a>, e.g. sole <a> in buffalo, dynamo, 
seraph, theatre; first <a> in banana, bravado, farrago, mama, palaver, 
papa, staccato; second <a> in archipelago, balaclava, ballast, breakfast. 
Exceptions: 
with <e> include artery, bolero (/‘bvlarau/ ‘garment’), soviet, first 
<e> in coterie; 
with <o> include abdomen, acrobat, aphrodisiac, bolero (/ba‘learau/, 
‘dance’), cellophane, cenotaph, custody, daffodil, espionage, exodus, 
geographic, iodine, kaolin, lobelia, mandolin, mimeograph, parody, 
police, purpose, ricochet, second, theocratic, violate, vitriol, first 
<o> in creosote, stereophonic, tobacco, second <o> in broccoli, 
choreographic, colloquy, obloquy, rollocking. 
And see again the medial Oddities, above. 
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3. Final position 
The default spelling is <-er>. Examples include: amber, arbiter, auger, 


bitter, brother, cancer, character, chipper, chorister, clover, double- 
decker, eager, ember, knocker, ladder, laager, lager, lever, Londoner, 
lumber, mother, neuter, number, other, oyster, proper, slander, slender, 
sober, thunder, timber, tuber, water, yonder, all comparative adjectives, 
e.g. better, brighter, colder, dearer, easier, happier; most agentive nouns 
formed from one-syllable verbs, e.g. drinker, jumper, killer, roamer, 
runner, speller, viewer (exceptions: actor, sailor); many longer agentive 
nouns where <e/y>-deletion (see sections 6.4, 6.6) applies, e.g. astrologer, 
astronomer, biographer, commuter, diner, geographer, lover, philosopher, 
remembrancer, settler, subscriber, also words with the suffix <-ometer> 
(‘measuring device’), e.g. barometer, thermometer (contrast kilometre 
pronounced /'kilami:ta/ to rhyme with metre and all its other compounds; 
however, kilometre is also pronounced /k1'lomita/ to rhyme with all the 
words ending <-ometer>). 
Exceptions (in addition to the Oddities, above): 

where the schwa vowel is spelt within a 2- or 3-phoneme grapheme: 

see those headings above 

spellings with <e>: genre, macabre (which appear to be the only two 

words where final <-re> is pronounced /ra/ rather than /a/ - contrast 

the Oddities in <-re> listed above), the (unstressed before a word 
beginning with a consonant phoneme), lasagne. There seem to be 
very few words in this set 

spellings with <a>: 

1) agenda, arcana, automata, bacteria, corrigenda, criteria, curricula, 
data, desiderata, ephemera, erotica, errata, esoterica, exotica, 
fauna, flora, fora, insignia, juvenilia, maxima, media, memorabilia, 
memoranda, militaria, millennia, miscellanea, minima, opera, 
phenomena, prolegomena, pudenda, referenda, schemata, 
stigmata, strata, trivia, which etymologically are all Latin or Greek 
neuter plural nouns (though agenda, opera are now always singular 
in English, increasingly data, media are too, and bacteria, criteria 
are often used as singulars by people who don’t know that their 
singulars are bacterium, criterion) 

2) also in a set of exotic loanwords of three or more syllables 
stressed on the penultimate syllable, e.g. abscissa, alfalfa, alpaca, 
amenorrhoea, amoeba, anaconda, angina, angora, antenna, 
arena, aroma, aspidistra, aurora, balaclava, balalaika, ballerina, 
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3) 


4) 


banana, bandanna, belladonna, bonanza, bravura, cadenza, 
candelabra, carcinoma, cassava, cavatina, cedilla, chim(a) 
era, chinchilla, chorea, cicada, concertina, conjunctiva, corona, 
cyclorama, diarrhoea, dilemma, diploma, duenna, emphysema, 
enigma, eureka, extravaganza, farina, felucca, flotilla, glaucoma, 
gonorrhoea, gorilla, granadilla, guerrilla, gymkhana, hacienda, 
hegira, hosanna, hydrangea, hyena, idea, iguana, indaba, influenza, 
koala, lactorrhoea, lacuna, liana, logorrhoea, Madonna, magenta, 
mahatma, manila, mantilla, mazurka, madeira, miasma, mimosa, 
nirvana, (o)edema, ocarina, omega pronounced /avu'mi:ga/, 
operetta, pagoda, panacea, panatella, panorama, pashmina, patella, 
patina pronounced /pea'tiina/, penumbra, persona, pharmacopoeia, 
pianola, placenta, propaganda, protozoa, pyorrhoea, regatta, 
rotunda, rubella, saliva, sarcoma, savanna, scintilla, semolina, 
siesta, sonata, sultana, syringa, tapioca, tiara, toccata, tombola, 
trachea, umbrella, urea, urethra, vagina, Valhalla, vanilla, vendetta, 
veranda, verbena, verruca, viola (/vit'jaula/ ‘musical instrument’) 
also in a set of loanwords of two syllables stressed on the first 
syllable, e.g. alpha, asthma, aura, china, cobra, coda, coma, 
comma, contra, copra, delta, diva, dogma, drama, eczema, era, 
extra, fatwa, gala, gamma, geisha, guava, gurkha, halma, henna, 
hydra, junta, karma, lama, lambda, lava, lemma, libra, llama, 
magma, manna, mantra, nova, okra, ouija, panda, pasha, plasma, 
plaza, polka, pukka, puma, pupa, quagga, quota, rhea, rota, saga, 
schema, skua, soda, sofa, stanza, stigma, tantra, toga, trauma, 
tuba, tufa, tuna, tundra, ultra, villa, visa, vista, viva, vodka, vulva, 
yoga, yucca, zebra, zeugma 

and a further ragbag of words which fit none of those categories, 
e.g. algebra, ammonia, anaemia, anaphora, anathema, apnoea, 
area, azalea, begonia, camellia, camera, chlamydia, cholera, 
cinema, cithara, cochlea, copula, cornea, cupola, dyspnoea, (en) 
cyclopaedia, enema, formula, gondola, harmonica, hernia, 
hysteria, japonica, myopia, nausea, omega pronounced /'aumiga/, 
orchestra, parabola, patina pronounced /'peztina/, peninsula, 
pergola, plethora, primula, replica, retina, salvia, scapula, sciatica, 
scrofula, sepia, stamina, swastika, taffeta, tarantula, tempera, 
utopia, vertebra, viola (/'vaijala/ ‘flower/girl’s name’). 


Any spelling for final /a/ which ends in <r, re> allows /r/-linking, e.g. 


central, 


ethereal, managerial, terrorist, authority, authorial, favourite, 
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calibration, fibrous, leverage, polarise, cigarette (with movement of the 
stress), Vicarious, vulgarity, dictatorial, rigorous (with deletion of the 
<u> from the final syllable of the stem), theatrical, sulphuric, injurious, 
adventurous - see section 3.6. 


5.5 Long pure vowels (other than /i:, u:/): 
/Q: 3: 3:/ 


5.5.1 /a:/ as in aardvark 


THE MAIN SYSTEM 

Basic grapheme <ar> 60% e.g. farther 

Other frequent <a> 34% e.g. father. More frequent in RP 

grapheme than in other accents. Regular 
before consonant clusters, but 
also occurs elsewhere. See Notes 

THE REST 

Oddities 6% in total 


<aa> only in baa, Baal, Graal, kraal, laager, naan, salaam 
<aar> only in aardvark, aardwolf, bazaar, haar 


<a.e> only in final syllables and only in about 30 
(mostly more recent French) loanwords, namely 
ballade, charade, chorale, facade, gouache, 
grave (/gra:v/, ‘French accent’), locale, morale, 
moustache, promenade (noun, ‘seafront path’; the 
verb with the same spelling, ‘walk at leisure’, is 
pronounced with /e1/), rationale, strafe, suave, 
timbale, vase, plus a set of words ending in /a:3/ 
spelt <-age>, e.g. badinage, barrage, camouflage, 
collage, corsage, decalage, décolletage, dressage, 
entourage, espionage, fuselage, garage pronounced 
/'‘gx#ra:3/, massage, menage, mirage, montage, 
triage, sabotage (only exception to final /a:3/ spelt 
<-age>: raj). The <e> in chorale, locale, morale, 
rationale differentiates those words visually from 
choral, local, moral, rational 
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2-phoneme graphemes 


<ah> 


<al> 


<are> 
<arr> 


<arre> 


<arrh> 


<as> 
<at> 
<au> 


<ear> 


<er> 


/wa:/ 


(1) spelt 


<oi> 


(2) spelt 
<oir> 


(3) spelt 
<oire> 


(4) spelt 
<ois> 


only word-final and only in ah, bah, hookah, hoorah, 
kabbalah, Shah, whydah 


only in calf, half, calve(s), halve(s), salve(s) (also 
pronounced /selv(z)/); almond, almoner, alms, 
balm, calm, embalm, malmsey, napalm, palm, 
psalm, qualm 


only in are when stressed 
only in bizarrery, carr, charr, parr 


only in barre, bizarre. /r/-linking occurs in 
bizarrery - see section 3.6 


only in catarrh. /r/-linking occurs in catarrhal - see 
section 3.6 


only in fracas 
only in eclat, entrechat, nougat 
only in aunt, draught, laugh(ter) 


only in hearken (also spelt, more regularly, harken), 
heart, hearth 


only in Berkeley (the town in England), Berkshire, 
Cherwell, clerk, derby, Derby, Ker pronounced /ka:/ 
(also pronounced /k3:/), sergeant 


See also Notes 


only in a few words more recently borrowed from 
French, e.g. bourgeoisie, coiffeu-r/se, coiffure, 
pointe, soiree, toilette 


mainly word-final and only in a very few words more 
recently borrowed from French, namely abattoir, 
boudoir, memoir, reservoir, voussoir, non-finally, 
only in avoirdupois. /r/-linking occurs in memoirist, 
noirish - see section 3.6 


only word-final and only in a very few words more 
recently borrowed from French, namely aide- 
memoire, conservatoire, escritoire, repertoire 


only word-final and only in a very few words more 
recently borrowed from French, namely avoirdupois, 
bourgeois (/z/ surfaces in bourgeoisie - see section 
7.2), chamois (the animal, pronounced /'famwa:/, 
as opposed to the leather made from its skin, 
pronounced /'fzmi:/, the latter also being spelt 
shammy), patois (contrast fatwa) 
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NOTES 


If we follow Crystal (2012: 131-2) and Upward and Davidson (2011: 176-9), 
‘more recent’ in terms of loanwords from French means after the Great Vowel 
Shift, which began about AD1400 and was complete by about AD1600. 
In RP, <a> is regular before consonant clusters, e.g. 
(in monosyllables) aft, craft, graft, haft, raft, shaft (exception: 
draught); chance, dance, glance, lance, prance, trance; ranch, can’t, 
chant, grant, plant, shan’t, slant (exception: aunt); ask, bask, cask, 
flask, mask, task; clasp, gasp, grasp, hasp, rasp; basque, masque; 
blast, cast, caste, fast, last, mast, past, vast (other exception: alms); 
(in final syllables of polysyllables) abaft, advance, enhance, avalanche, 
command, countermand, demand, remand, enchant, bergomask, 
aghast, contrast (noun and verb); 
(in non-final syllables) macabre; padre; after, rafter, example, sample; 
chancel, chancery, revanchis-m/t, commando, slander, answer, basket, 
casket; bastard, caster, castor, disaster, flabbergasted, ghastly, 
master, nasty, pasta (also pronounced with /x/), pasteurise, pastime, 
pastor, pasture, plaster (exceptions: aardvark, aardwolf, laughter, 
malmsey). 
Otherwise, in non-rhotic accents such as RP no rules can be given for where 
/a:/ is spelt <a> rather than <ar>, so here are some lists of words where 
/a:/ spelt <a> occurs: 
several words before medial /6/ spelt <th>, e.g. father, lather, rather 
(exception: farther), and before final /f, s, 8/ spelt <-ff(e)/-ph, -ss, 
-th>, e.g. chaff, distaff, staff, giraffe; cenotaph, and graph and all its 
unsuffixed compounds: auto/cardio/ di/encephalo/epi/mimeo/para/ 
photo/tele/tri-graph (exceptions: calf, half); brass, class, glass, grass, 
pass (exception: arse); bath, path (exception: hearth) 
word-finally in bra, hoopla, Libra, (grand)ma, mama, (grand) pa, papa, 
Schwa, spa (contrast several of the Oddities) 
a large set of words, many of them loanwords, but all ending in a vowel 
phoneme and with stressed /a:/ spelt <a> in the penultimate syllable, 
e.g. armada, avocado, balaclava, banana, blasé (sometimes stressed 
on last syllable), bravado, bravo (sometimes stressed on last syllable), 
cadre, cantata, cascara, cassava, cicada, cinerama, cyclorama, 
desiderata, desperado, drama, farrago, finale, gala, Gestapo, guano, 
guava, gymkhana, iguana, incommunicado, karate, khaki, lager, 
lama, lava, legato, liana, literati, llama, llano, marijuana, mascara, 
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meccano, nazi, pajama, palaver, panorama, pastrami, plaza, praline, 
pro rata, pyjama, safari, saga, salami, schemata, sonata, soprano, 
Staccato, stigmata, strata, sultana, tiara, toccata, tomato, tsunami, 
virago 
a final ragbag: adagio (second <a>), amen, banal (second <a>), 
castle, claque, corral, debacle, fasten, plaque, pajamas (second <a>), 
pyjamas (first <a>). 
Words in which final /a:/ is spelt <-ar> allow /r/-linking, e.g. far away 
/farra'we1/, sometimes with <r>-doubling, e.g. sparring - see section 3.6. 
/wa:/ has the 2-grapheme spelling <ua> in guacamole, guano, guava, 
iguana, suave. 


5.5.2 /3:/ as in earl 


THE MAIN SYSTEM 


For all these categories see Notes. 

Basic grapheme <er> 38% e.g. berth, exert, herd, serf, 
sherd, tern, twerp; defer, 
infer, prefer, refer 


Other frequent graphemes <ir> 18% e.g. birth, fir, whirl 
<or> 17% regular after initial /w/, e.g. 
word 
<ur> 17% e.g. fur, gurgle, surf, turn, 
urn 
THE REST 
Oddities 10% in total 
<ear> 8% never word-final, and only in dearth, 


earl, early, earn, earnest, earth, heard, 
hearse, learn, pearl, rehearse, (re)search, 


yearn 
<ere> only in were when stressed 
<err> in stem words only in err (for which 


see also section 4.3.2), but frequent in 
consonant-doubling before suffixes, e.g. 
preferred (see section 4.2) 
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<eu> only in chauffeuse, coiffeuse, masseuse, 
milieu 
<eur> non-finally, only in secateurs; otherwise 


only word-final and only in about 12 
recent loanwords of French origin, 

e.g. chauffeur (if stressed on second 
syllable, pronounced /fau'f3:/), coiffeur, 
connoisseur, entrepreneur, hauteur, 
masseur, poseur, provocateur, raconteur, 
repetiteur, restaurateur, seigneuranda 
few other rare words 


<irr> only in chirr, shirr, whirr 
<olo> only in colonel 
<our> only medial, and only in adjourn, bourbon 


(‘whiskey’), courteous, courtesy, journal, 
journey, scourge, and tourney pronounced 
/'t3:niz/ (also pronounced /'tuani:) 


<urr> in stem words only in burr, purr, but 
frequent in consonant-doubling before 
suffixes, e.g. furry, demurring, occurred 
(see section 4.2) 


<yr> only in gyrfalcon, myrmidon, myrtle 
<yrrh> only in myrrh 
2-phoneme graphemes (none) 


NOTES 


/3:/ is rare in initial position, and the dozen or so words in which it does 
occur are split between <ear> (earl, early, earn, earnest, earth), <er> 
(ermine, ersatz, erstwhile), <err> (only in ern, <ir> (only in irk) and <ur> 
(urban, urbane, urchin, urge, urgent, urn). 

<er> is the default spelling in medial and final positions. It is regular in 
hyperbole, interpret, superfluous, superlative and other words with initial 
/har'p3:, Int'3:, surp'3:/ stressed on the second syllable and spelt <hyper-, 
inter-, super->; concern, discern, convert, revert and other words with the 
(Latin) elements <cern, vert>; confer, defer, prefer, refer with the (Latin) 
element <-fer>. Other examples: adverse, alert, averse, assert, berth, 
certain, commercial, conserve and derivatives, deserve, desert (/dt1'z3:t/, 
‘abandon’, as opposed to /'dezat/, ‘arid area’), dessert, determine, disperse, 
epergne, eternal, exertion, exterminate, ferment, germ, gherkin, herb, herd, 
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hermit, immerse, inert(ia), jersey, kerchief, kernel, merge, mercenary, nerd, 

observe, perfect, perk, permanent, person, quern, reserve, reverse, Serf, 

serpent, serve, submerge, swerve, tern, terse, thermal, thermos, twerp, 
universe, verger, verse, vertigo. 

<or> is regular after initial /w/ whether spelt <w> or <wh>: 
whortle(berry), word, work, world, worm, worse(n), worship, worst, wort, 
worth(y); otherwise only in attorney. Exceptions: were, whirl, whir(r). 

<ir> is regular in the prefix /'s3:kam/ spelt <circum->, e.g. circumflex, 
circumstance, circumvent, also after /g, kw, 8/, as in gird, gird(le), girder, 
girl, girn, girt, girth (exceptions: gherkin, gurgle, regurgitate); quirk, quirt, 
squirm, squirt; third, thirst, thirteen, thirty (exceptions: thermal, thermos, 

Thursday). Otherwise <ir> occurs in an unpredictable set of words, e.g. 

besmirch, birch, bird, birth, chirp, circle, circus, cirque, dirk, dirt, fir, firm, 

firmament, first, firth, flirt, hirsute, irk, kirtle, mirth, shirk, shirt, sir, skirl, 
skirmish, skirt, smirk, stir, swirl, twirl, Virgo, virtual, virtue, virtuoso, 
virtuous, whir, whirl, zircon. 

<ur> is regular: 

1) in the (Latin) verb element <-cur> (‘run’) as in concur, (dis)cursive, 
cursor, excursus, incursion), occur, recur and more generally after 
/k/: cur, curb, curd, curfew, curl, curlew, curse, curt, curtail, curtain, 
curtsey, curve, scurf, scurvy (exceptions: colonel, courteous, courtesy, 
kerchief, kernel, kersey, kirtle, skirl, skirmish, skirt); 

2) after /b, t(/ and after /s/ in initial syllables of polysyllables: auburn, 
burble, burden, burdock, burgess, burgher, burglar, burgeon, burgoo, 
burgundy, burlap, burlesque, burly, burn, burnet, burnish, burp, bursar, 
burst, disburse, hamburger, laburnum, suburb, church, churlish), 
churn, surface, surfeit, surgeon, surly, surmise, surmount, surpass, 
surplice, surplus, surveillance, survey, survive (exceptions: berg, berth, 
birch, bird, birth, chirp, chirr, concerto if pronounced with /3:/ rather 
than /ea/, serpent). 

Otherwise <ur> occurs in an unpredictable set of words, e.g. absurd, 

appurtenance, blur, blurt, demur, disturb, diurnal, expurgate, frankfurter, 

fur, (re)furbish, furl, furlong, furlough, furnace, furnish, furniture, further, 
furtive, furze, gurgle, hurdle, hurl, hurt, hurtle, insurgent, jodhpurs, 
liturgy-y/ical, lurch(ern), lurk, metallurg-y/ical, murder, murky, murmur 

(first syllable), nasturtium, nocturnal, nurse, nurture, purblind, purchase, 
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purgation, purgatory, purge, purl, purlieu, purloin, purport, purse, pursu-c/it, 
purvey, regurgitate, return, Saturn, saturnine, slur, splurge, spur, spurn, 
spurt, surd, surf, taciturn, Thursday, turban, turbid, turbine, turbot, 
turbulent, turd, turf/ves, turgid, turkey, turmoil, turn, turnip, turquoise, 
turtle, urban, urbane, urchin, urge, urgent, urn. In some words where <ur> 
is not stressed it may be reuced to /a/, e.g. purport, pursu-e/it, surpass. 

Words in which final /3:/ is spelt with a grapheme which includes final 
<-r> allow /r/-linking (see section 3.6), e.g. murmuring, whirring, purring, 
sometimes with <r>-doubling (see section 4.2), e.g. conferring, occurring, 
demurral. 


5.5.3 /o:/ as in awe 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic grapheme <or> 25% (with e.g. order, afford, for. See also 
<ore, ar>) Table 5.4 


Other frequent <ore> only word-final, e.g. before, 
graphemes except in compounds of fore-. 
See also Table 5.4 


<ar> regular medially after /w/, e.g. 
ward. See also Table 5.3 


<a> 29% regular before /I/; otherwise 
only in water, waft pronounced 
/wo:ft/, wrath pronounced 


/1918/ 

<au> 9% e.g. autumn, cause; word-final 
only in landau, Nassau. See 
also Table 5.4 

<aw> 9% never before /r/; e.g. awful, 


crawl, paw. See also Table 5.4 
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THE REST 
Oddities 28% in total 
<al> 5% only in balk, calk, chalk, falconer, stalk, talk, 
walk 


<augh> 2% only in aught, caught, daughter, distraught, 
fraught, haughty, (Mod Naught(on), naught, naughty, 
onslaught, slaughter, taught 


<aul> only in baulk, caulk, haulm 


<aur> only in bucentaur, centaur, dinosaur (and the 
names of various dinosaur species, e.g. pterosaun, 


minotaur 
<awe> only in awe and derivatives other than awful 
<oa> only in abroad, broad, broaden 
<oar> 2% only in boar, board, coarse, hoar, hoard, hoarse, 


oar, roar, soar 


<oer> only in Boer pronounced /b9:/ (also pronounced 
/bua/) 
<oor> 3% only in door, floor; also boor, moor, poor, spoor 


if pronounced to rhyme with door, floor 


<orp> only in corps (plural), pronounced /k2:z/ 

<orps> only in corps (singular), pronounced /k3:/ 
<orr> only in abhorred 

<ort> only in mortgage, rapport. /t/ surfaces in 


rapporteur - see section 7.2 


<ough> 6% only in bought, brought, fought, nought, ought, 
(be-)sought, thought, wrought 


<our> 8% only in bourne, court(esan), course, four, mourn, 
pour, source, yours) 


<ou’re> only in you’re. See section A.9 in Appendix A 


2-phoneme graphemes (none) 


NOTES 


Generalisations for /3:/ are weak because it has so many spellings. However, 
some are possible for instances of /3:/ after /w/ and before /|/. <ar> is regular 
in medial position after /w/, however spelt - see Table 5.3 and cf. /p/, section 
5.4.4. There are no words in which /9:/ is spelt <ar> without a preceding /w/. 


TABLE 5.3: 
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SPELLINGS OF /3:/ AS <ar> AFTER /w/. 
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after /w/ spelt <u> - 


also, according to particle 
physicists, quark /kwo:k/ 
(pronounced /kwa:k/ by 
the rest of us) 


swarm, swart(hy), (a)thwart, towards, 
untoward, warble, ward, warden, 
warfarin, warlock, warm, warn, warp, 
wart; also war (only example in final 
position and therefore only one with 
potential /r/-linking (and concomitant 
<r>-doubling - see section 4.2), e.g. 
warring - see section 3.6) 


after /w/ 
always after /k/ spelt after /w/ spelt <w> 

spelt <wh> 
<q> 
quart(an/er/et/ic/ile/z2); award, dwar-f/ves, reward, sward, swarf, | whar-f/ves 


Exceptions (words in which 


/D:/ is not spelt <ar> after /w/) 


quorn, quorum, squaw, 
squawk 


caterwaul, sworn, walk, wall, walnut, 
waltz, water, whorl, worn; also waft, 
walrus if pronounced with /9:/ rather 
than /b/. 

wall, walnut, walrus, waltz instead follow 
the generalisation about /3:/ before /I/, 
next 


<a> is regular in all positions before /I/: 


initial: albeit, alder, alderman, all, almanac (usually pronounced with 
/z/), almighty, almost, already, altar, alter, alternate (with both 
stresses and meanings), although, altogether, always (only exception: 
awl, which is also pre-final - see below) 

medial (except before final /I/): bald, balderdash, baldric, balsam(io, 
balti, falsetto, falter, halt, halter, 
instalment, malt, palfrey, palsy, paltry, psalter, salt, scald, thraldom 


enthralment, falcon, false, 
(also spelt thralldom), walnut, walrus (also pronounced with /v/), 
waltz (exceptions: assault, cauldron, fault, vault) 

pre-final (= medial before final /I/; N.B. This is the only place in my 
entire analysis where | have found it useful to use the term ‘pre-final’): 
appal, ball, call, enthral, fall, gall, hall, pall, small, squall, (fore/in) stall, 
tall, thrall, wall (exceptions: caterwaul, haul, maul, awl, bawl, brawl, 
crawl, drawl, scrawl, shawl, sprawl, trawl, yawl. whorl). As this list 
shows, here /I/ after <a> is mostly spelt <II>, the only exceptions being 
appal, enthral. On variation between <I> and <Il> see also section 4.4.7. 


The only words in which /3:/ is spelt <a> other than before /I|/ are waft if 
pronounced /wo:ft/, water, and wrath if pronounced /r3:8/. 
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Beyond this it is simplest to list spellings of /3:/ in <au, aw, or, ore> - 
see Table 5.4. 

Many other examples of medial /3:/ spelt <or> before /r/ arise from 
suffixation of words ending in <-ore> (e.g. boring), when /r/-linking 
occurs - see section 3.6. In all these suffixed cases and in all the cases 
where medial /3:/ spelt <or> occurs before a vowel, the <r> is both part of 
grapheme <or> spelling /3:/ and a grapheme in its own right spelling /r/ 
(for dual-functioning see section 7.1). However, dual-functioning does not 
apply to />:/ spelt <au> before /r/ since <au> already spells /9:/ without 
the following <r>. 

The only example of <r>-doubling in final />:/ spelt <or> appears to 
be abhorred, where the stress in the stem word is on the last syllable and 
<rr> arises from the main consonant-doubling rule - see section 4.2. In 
abhorrent there is both <r>-doubling and /r/-linking (see section 3.6) but 
the preceding vowel changes to /p/ and <rr> spells only /r/. 

Although | have said above that /3:/ before /r/ is never spelt <aw>, we 
should remember the childish pronunciation of drawing as /‘dro:irin/. 

Words in which final />:/ is spelt with other graphemes which include final 
<r> also allow /r/-linking, e.g. hoary, flooring, pouring - see again section 3.6. 


TABLE 5.4: <au, aw, or, ore> AS SPELLINGS OF /93:/. 


For other spellings of /3:/ see above. 


Initial Medial Final 
<au> (before /r/) (before /r/) only in landau 
aura, aural (also apatosaurus and many other 
pronounced with /au/),| dinosaur names, saurian, 
aureole, aureomycin, Sauropod, taurine, Taurus, 
auricle, auriferous, thesaurus. 
aurochs, aurora. See notes above Table 


See notes above Table 


(not before /r/) (not before /r/) 
aubretia, auburn, applaud, assault, astronaut, 
auction (also bauble, bauxite, caterwaul, 


pronounced with /p/), | caucus, caudal, cauldron, cause, 
audacious, audible, caustic, cauterise, caution, clause, 
audience, audio, audit, | daub, daunt, debauch, exhaust, 
auger, augment, augur,| faucet, fault, faun, fauna, flaunt, 
August, august, auk, flautist, fraud, gaudy, gaunt, 
aumbry, auspic-e/ious, 
austere, authentic, 


The phoneme-grapheme correspondences, 2: Vowels 177 


authority), autis-m/ 
tic, autograph, 
automatic, automobile, 
autonomy and many 
other compounds of 
<aut(o)->, aqutumn, 
auxiliary 


gauntlet, gauze, glaucoma (also 
pronounced with /au/), glaucous, 
haul, haunch, haunt, holocaust, 
hydraulic (also pronounced with 
/0/), inaugurate, jaundice, jaunt, 
juggernaut, laud, launch, launder, 
laundry, marauder, maudlin, 
maul, mausoleum, nausea-a/ous, 
nautical, paucity, paunch, pauper, 
pause, plaudit, plausible, raucous, 
Sauce, saucer, sauna, saunter, 
staunch, taunt, taut, vault, vaunt 


<aw> | awful, awkward, awl, bawd, bawl, brawl, brawny), caw, claw, draw, 

(never | awning crawl, dawdle, dawn, drawl, flaw, gnaw, guffaw, 

before drawn, fawn, gawp, hawk, haw, jackdaw, 

/r/) hawser, lawn, mawkish, pawn, jaw, law, lockjaw, 
prawn, scrawl, scrawny, shawl, macaw, maw, paw, 
shawm, spawn, sprawl, squawk, raw, rickshaw, saw, 
tawdry, tawny, tomahawk, trawl, | seesaw, slaw, squaw, 
trawler, yawl, yawn straw, thaw, yaw 

<or> (before /r/) (before /r/) abhor, cantor, 


oracy, oral, oration, 
orient (noun). See 
notes above Table 


(not before /r/) 

or, orb, orbit, orc, 
orchard, orchestra, 
orchid, ordain, ordeal, 
order, ordinary, 
ordnance, ordure, 
organ, organdie, 
organise, orgasm, 
orgy, ormolu, 
ornament, ornate, 
ornery, ornithology, 
orphan, orthodontist, 
orthodox and many 
other compounds of 
<ortho->, ortolan, orts 


aurora, authorial, borax, chlorine, 
choral, chorus, corporeal (second 
syllable), decorum, dictatorial, 
editorial, euphoria, flora, floral 
(also pronounced with /b/), 
forum, glory, memorial, oratorio 


(third syllable), quorum, variorum. 


See notes above Table 

(not before /r/) 

abort(ion), absorb, absorption, 
adorn, afford, border, borrie), 
cavort, chord, chortle, cohort, 
consort, cord, cork, corm, 

corn, corner, cornice, corporal, 
corporeal (first syllable), 
corporation (first syllable), 
corpse, corset, corvette, disgorge, 
divorce, dork, dormitory, endorse, 
enormous, exorcise, extortion, 


condor, corridor, 
cuspidor, décor, for, 
grantor, humidor, 
ichor, lessor, 
matador, mentor, 
mortgagor, nor, or, 
praetor, quaestor, 
realtor, tor, 
toreador, vendor 
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TABLE 5.4: <au, aw, or, ore> AS SPELLINGS OF /3:/. CONT. 


Initial 


Medial 


Final 


force, forfeit, forge, fork, forlorn, 
form, forsythia, fort, forth, 
fortune, gorge, gormandise, 
gormless, gorse, horde, hormone, 
horn, hornet, horse, horticulture, 
important, inform, lord, lorgnette, 
morbid, mordant, morganatic, 
morgue, morning, morphine, 
morse, morsel, mortal, mortar, 
nork, normal, north(-ern/ly), 
perform, platform, porcelain, 
porch, porcupine, pork, porpoise, 
porphyry, portion, portico, portrait, 
record, remorse, report, resort, 
scorch, scorn, Scorpio, scorpion, 
shorn, short, snorkel, sorcerer, 
sordid, sorghum, sort, sport, stork, 
Storm, suborn, support, sword, 
sworn, thorn, torc, torch, torment, 
torn, tornado, torpedo, torpid, 
torque, torsion, torso, tort, tortoise, 
torture, uniform, vortex, worn 


<ore> 


(occurs only in ore, 
which | classify as 
final) 


only in compounds of fore-, of 
which there are 60+ 


adore, albacore, before, 
bore, carnivore, chore, 
claymore, commodore, 
core, deplore, encore, 
explore, fore, furore, 
galore, gore, herbivore, 
ignore, implore, lore, 
more, omnivore, ore, 
pinafore, pore, score, 
semaphore, shore, snore, 
sophomore, sore, spore, 
stevedore, store, swore, 
sycamore, therefore, 
tore, whore, wore, yore 
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5.6 Diphthongs (other than /el, al, 9U/): 
/d1 aU €9 Id Ud/ 


5.6.1 /d1/ as in oyster 


THE MAIN SYSTEM 


Basic grapheme <oi> 61% — e.g. boil. Regular in initial and medial 
positions. Never word-final (except in 
the Greek phrase hoi polloi) 


Other frequent <oy> 39% e.g. boy. Regular in word-final position; 


grapheme rare elsewhere, but see Notes 
THE REST 

Oddity <aw> only in lawyer, sawyer 

2-phoneme grapheme /otja/ only in coir /'k>1ja/ 

(counting the automatic spelt 

/j/-glide as part of the first <oir> 

phoneme) 

NOTES 


<oy> is regular in word-final position, <oi> elsewhere. Exceptions: 
<oi> word-finally: only in hoi polloi 
<oy> non-finally: only in arroyo, boycott, coypu, foyer pronounced 
/‘fatja/ (also pronounced /'fwaijet, 'fotjet/), gargoyle, groyne, hoyden, 
loyal, oyster (only occurrence in initial position), royal, soya, voyage. 
In arroyo, foyer pronounced /'foija/, loyal, royal, soya, voyage the 
<y> is both part of <oy> spelling /21/ and also a grapheme in its own 
right spelling /j/. For dual-functioning see section 7.1. 
coiris the only word in the language with /51ja/ spelt with the single grapheme 
<oir>.Word-final /21ja/ also has the 2-grapheme spellings <oya, oyer> only 
in soya, foyer pronounced /'foija/, the first word of the name of the ancient 
court known as Oyer and Terminer, and coyer, comparative of coy (‘Had we 
but world enough and time...’), and the 3-grapheme spelling <-awyer> only 
in lawyer, sawyer /'loija, 's1ja/. Effectively, therefore, all the example words 
mentioned so far in this paragraph rhyme. Medially, />1ja/ occurs in loyal, 
royal and their derivatives, and possibly nowhere else. It is also noticeable 
that within this (tiny) set of words, only coir itself does not contain <y>. 
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5.6.2 /au/ as in ouch 


THE MAIN SYSTEM 
Basic graphemes <ou> 93% (with word- e.g. about 
final <ow>) 


word-final e.g. allow 
<ow> 


Other frequent graphemes (none) 


THE REST 
Oddities 7% in total 
<aow> only in miaow 
<au> only in ablaut, faustian, gaucho, 


gauleiter, glaucoma pronounced 
/glau'kauma/ (also pronounced 
/glo:'kauma/), sauerkraut (twice), 
umlaut and the Greek letter name 
tau; also in aural if pronounced 
/‘avural/ to distinguish it from oral 
/‘drral/ 


<ough> only in bough, doughty, drought, 
plough, slough (‘muddy place’) 


pre-consonantal 6% e.g. brown 
<ow> 


2-phoneme grapheme (counting /auwa/ 
the automatic /w/-glide as part 
of the first phoneme) 


(1) spelt <hour> only in hour 


(2) spelt <our> — only in devour, flour, lour, our, 
ours, scour, sour and dour 
pronounced /'dauwa/ (which makes 
it a homophone of dower; dour 
is also pronounced /dua/). These 
words allow /r/-linking, e.g. floury, 
scouring - see section 3.6. Also see 
Notes 
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NOTES 


<ow> is regular: 
in word-final position, e.g. cow (only exceptions: thou /Oau/, archaic 
second person singular subject pronoun, thou /@au/, ‘one thousandth 
of an inch/a thousand pounds/dollars’) 
before a schwa vowel spelt with a vowel letter or digraph, i.e. only in 
bowel, dowel, rowel, towel, trowel, vowel; bower, cower, dower, flower, 
glower, power, shower, tower, coward, dowager (no exceptions), plus 
howitzer with /1/ and prowess with /e/; this would also cover rowan in 
its Scottish pronunciation /‘rauwan/ (/'rauwan/ in England) 
in most words ending in /aul/, namely cowl, fowl, growl, howl, jowl, 
owl, prowl, scowl, yowl (only exception to this sub-pattern: foul) 
in most words ending in /aun/, namely brown, crown, down, drown, frown, 
gown, renown, town (only exceptions to this sub-pattern: (pro)noun). 
<ou> is regular everywhere else. Exceptions (in addition to the Oddities 
above and <ow> subpatterns just listed): chowder, crowd, dowdy, powder, 
rowdy, cowrie, dowry, frowsty; blowsy (contrast blouse), bowser, browse, 
dowse, drowse, drowsy, frowsy. 

/auwa/ also has the 2-grapheme spellings <-ower> in bower, cower, 
dower, flower, glower, power, shower, tower, <owar> in coward and 
<owa> in dowager. All of these words except coward allow /r/-linking, e.g. 
cowering, flowery - see section 3.6. 

In the words with medial <ow> followed by a vowel letter listed above, 
the <w> also represents a /w/-glide between /au/ and the following schwa 
(or /1/ or /e/). In these words, therefore, the <w> is both part of the digraph 
<ow> spelling /au/ and a grapheme in its own right spelling /w/ (for dual- 
functioning see section 7.1). The words within this set ending in <-ower> 
form perfect rhymes with the words ending in <-our> listed above, and 
these too seem to me to have an automatic /w/-glide - but the /w/ is not 
represented in the spelling. So for the /w/-glide the alternative spellings 
mean ‘Now you see it, now you don’t ‘. For more on that, see sections 3.8.7 
and 9.0. 


5.6.3 /ed/ as in air 


For the two sets of percentages see Notes. 
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THE MAIN SYSTEM 


Basic grapheme <are> 59% 


(24%) (with 


<ar>) 


Other frequent <ar> 
graphemes 


<air> 28% (12%) 


only word-final, e.g. bare, care, 
fare, flare, hare, pare, stare, tare, 
ware. See Notes 


initially, only in area, Aries; never 
word-final. Regular medially, 
especially where word-final <e> is 
deleted before a suffix beginning 
with a vowel letter, e.g. caring, 
but there are also independent 
examples, e.g. adversarial, 
Aquarius, barium, commissariat, 
garish, gregarious, hilarious, 
malaria), multifarious, nefarious, 
parent, precarious, proletariat, 
Sagittarius, variegated, various, 
vary, and a fairly large set of 
nouns/ adjectives in <-arian>, 
e.g. agrarian, barbarian (2nd 
<ar>), centenarian and other age 
terms, egalitarian, grammarian, 
librarian, proletarian, utilitarian, 
vegetarian, in all these cases 

the <r> also spells /r/ (for 
dual-functioning see section 7.1). 
Medially before a consonant, only 
in scarce, scarcity. See Notes 


regular initially because of air and 
its compounds (see under <aer> 
below); medially, only in fairy, 
prairie (with dual-functioning <r> 


- see section 7.1) and (Scots) bairn, 


cairn, laird, otherwise only word- 
final and only in affair, air (again), 
chair, corsair, debonair, despair, 
eclair, fair, flair, hair, impair, lair, 
mohair, pair, repair, stair 


THE REST 


Oddities 


- non-final 


- final 


<ear> 


<aer> 


<ao> 
<eir> 


<er> 


<aire> 


<ayer> 


<ayor> 


<eah> 


<e’er> 


<eir> 
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10% (4%) only word-final and only in 
(for(e)-)bear, pear, swear, tear 
(‘rip’), wear 


3% (60% !!) in total 


except in anaerobic, faerie, only initial and only in 
words where the morpheme air is followed by a vowel 
phoneme, namely several compounds of aero-, e.g. 
aerobic, aerodrome, aeroplane, aerosol, etc., plus aerate, 
aerial. \n all these cases the <r> is both part of <aer> 
spelling /ea/ and a grapheme in its own right spelling 
/r/. For dual-functioning see section 7.1. Compounds 
with the spelling <air>, e.g. aircraft, airmail, are more 
numerous, and therefore (because there are so few other 
words beginning /ea/, namely area, e’er, and heir and 
its derivatives) <air> is the main word-initial spelling 


only in aorist 
only in theirs 


only in bolero (/ba‘learau/, ‘dance’), concerto 
pronounced /kan'tfeatau/ (also pronounced 
/kan't{3:tau/), concierge, recherche, scherzo, sombrero. 
In bolero, sombrero the <r> is both part of <er> 
spelling /ea/ and a grapheme in its own right spelling 
/r/. For dual-functioning see section 7.1 


only in a few polysyllabic recent loanwords of mainly 
French origin, namely affaire, commissionaire, 
concessionaire, doctrinaire, laissez-faire, legionnaire, 
millionaire, questionnaire, secretaire, solitaire. 
/r/-linking occurs in millionairess - see section 3.6 


only in prayer pronounced /prea/ (‘religious formula’; 
contrast prayer pronounced /'preija/, ‘one who prays’) 


only in mayor and derivatives. /r/-linking occurs in 
mayoral, mayoress - see section 3.6 


only in yeah 


only in e’er, ne’er, where’er and a few other archaic 
contracted forms. See Section A.9 in Appendix A 


not counted (22% !!) only in their 
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<ere> not counted (37% !!) (with <er>) only in ere, there, 
(no)where and a few polysyllabic recent loanwords of 
French origin, namely ampere, brassiere, cafetiere, 
commere, compere, confrere, misere, premiere. 
/r/-linking occurs in thereupon, wherever, compering, 
etc. - see section 3.6 


<erre> only in parterre 
<ey’re> only in they’re. See Section A.9 in Appendix A 


<heir> — only in heir. There is /r/-linking in heiress, inherit - see 
section 3.6, and in inherit /h/ also surfaces - see section 
7.2 


2-phoneme (none) 
graphemes 


NOTES 


If we follow Crystal (2012: 131-2) and Upward and Davidson (2011: 176-9), 
‘more recent’ in terms of loanwords from French means after the Great Vowel 
Shift, which began about AD1400 and was complete by about AD1600. 

<are> is regular word-finally (and would be more so if there, where 
were spelt ‘thare, ‘whare - but that would destroy the parallelism with here), 
<air> initially, <ar> medially. 

Scarce, scarcity are the only words in which /ea/ spelt <ar> occurs 
before a consonant and the <r> is only part of the grapheme <ar> (hence 
more logical spellings for them would be “scairce, “scaircity, on the model 
of bairn, cairn, laird); in all other cases <ar> occurs before a vowel and 
the <r> is both part of <ar> spelling /ea/ and a grapheme in its own right 
spelling /r/ - for dual-functioning see section 7.1. 

Similarly, in all the patterns with word-final <r(e)> (which is every word- 
final pattern listed above except <eah>), there is potential /r/-linking (and 
dual-functioning) before a suffix beginning with a vowel phoneme (e.g. 
staring, repairing, wearing) or a following word beginning with a vowel 
phoneme - see section 3.6. Examples before a following word: prayer of 
intercession, mayor of Sheffield, ne’er a hope in h'll, misere ouverte, they’re 
arriving, heir apparent. 

Carney (1994: 110-1) points out that text frequencies for /ea/ differ 
vastly according to whether function words are included or not: ‘The 
three words where, there and their account for more than half the raw 
text frequency of /ea/’. In this book | have almost exclusively used his 
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function-words-excluded frequencies, but at the head of this entry and 
against a couple of the Oddities | have shown, for interest, both those and (in 
brackets) the very different frequencies when function words are included. 


5.6.4 /ta/ as in ear 


For why the percentages are double Carney’s see Notes. 
For all medial and final occurrences see also Table 5.5. 


THE MAIN SYSTEM 


Basic grapheme <ear> 56% (except in afeard, arrears, beard, 
bleary, weary and the half- 
exception ear) only word-final, e.g. 
appear, hear 


Other frequent <ere> 24% only word- final, e.g. interfere, 
graphemes (with mere, sincere. Carney’s (original) 
<er>) percentage excludes here, which 
would skew the figures (cf. /ea/, 
just above) 


<er> except in era, only medial, e.g. 
hero, series. \In all cases the <r> 
functions also as the spelling of /r/ 
— see Section 7.1 


<eer> 8% except in eerie (where <r> 
functions also as the spelling of /r/ 
- see section 7.1), only word-final, 
e.g. beer 


THE REST 


Oddities 12% in total 
<eir> only in weir, weird 


<eyr> only in eyrie (where <r> functions also as the 
spelling of /r/ - see section 7.1) 


<e’re> only in we’re. See Section A.9 in Appendix A 


<ier> never initial; medially, only in fierce, pierce, tierce; 
otherwise only final 
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<ir> only in emir, fakir (can be stressed on either syllable, 
and is also pronounced with /a/), kir, kirsch, nadir 
pronounced /'nerdt1a, nz'd1a/ (also pronounced 
/‘ne1da/), souvenir, tapir 


2-phoneme graphemes (none) 


NOTES 


Carney (1994: 190) posits two sources of this phoneme in RP: 

1) cases where there used to be a /r/ consonant following an /i:/ vowel. 
A letter <r> remains in the spelling (usefully for speakers with rhotic 
accents), but in RP the /r/ has disappeared except when /r/-linking 
occurs (see section 3.6); 

2) cases where there never was a /r/ phoneme but an /i:/ has combined 
with a following /2/. 

| accept the first category but not the second. Carney does say (ibid.) that 
the second category ‘for some speakers may still represent disyllabic /i:/ 
plus /a/’, and | think this is the case in my accent and that of many other 
RP speakers. For example, on Carney’s analysis the expression Stay, dear 
and the word stadia would both be analysed as pronounced /'sterd1a/, with 
two syllables, but | think only the former is so pronounced and that stadia 
is pronounced /'stetdi:ja/, with three syllables (and an automatic /j/-glide 
before the final schwa - see section 3.8.8). | have therefore assigned almost 
all occurrences with <r> to /1a/ (for the exceptions see below), and all 
occurrences without <r> instead to /izja/. 

Fortunately, unlike the situation with /1, i:/, Carney provides just 
enough information to re-calculate the percentages for /1a/ without the 
second category, and the results (which are double the percentages given 
by Carney) are shown above. 

<eer> may seem a more ‘basic’ spelling of /1a/ than <ear> but accounts 
for a much smaller percentage of its occurrences. 

Curiously, Carney does not list <ier> or any of the words containing it 
in his treatment of /1a/, presumably because none occurred in his corpus. 

The only words in which /1a/ occurs initially seem to be ear (where it is 
the whole word and therefore also final), eerie, era and eyrie. 

Spellings with <r> which | believe belong to /itja/ rather than to /1a/ are 
few in number, and restricted to: 

1) adjectives in <-ear, -iar>: cochlear, linear, nuclear, familiar, peculiar; 

2) comparative adjectives in <-ier>, e.g. easier, happier. 

In all these cases | believe the ending has two syllables. 
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A serendipitous outcome of my analysis of /1a/ is that all its occurrences 
in polysyllables are stressed, wherever in the word they occur, except that 
fakir, frontier, nadir can be stressed on either syllable and belvedere is 
often stressed on the first syllable. 

Medially, the predominant spelling is <er>, and in polysyllables there 
are no exceptions; finally, the predominant spelling is probably <eer> in 
lexical frequency but definitely <ear> in text frequency because most of the 
words in which it occurs have high frequency. However, because no clear 
guidance can be given on which spelling of /1a/ occurs in which words in 
final position, lists for both medial and final positions are given in Table 5.5. 

In all the patterns listed above which occur word-finally (which is all 
of them except <er, eyr>), there is potential /r/-linking before a suffix 
beginning with a vowel phoneme (e.g. hearing, sincerity, beery) or a 
following word beginning with a vowel phoneme - see section 3.6. Examples 
before a following word: hear and obey, beer and skittles, we’re off! 


TABLE 5.5: SPELLINGS OF /1a/ IN MEDIAL AND FINAL POSITIONS. 


medial final 


<ear> afeared, beard <ear> blear, clear, dear, drear, ear, fear, 
gear, hear, near, rear, sear, shear, smear, 
spear, tear (‘moisture from eye’), year, 
appear, arrear 


<eer> beer, cheer, deer, jeer, leer, peer, 
queer, seer, sheer, sneer, steer, veer, 
auctioneer, Brexiteer, career, charioteer, 
commandeer, domineer, electioneer, engineer, 
gazetteer, mountaineer, muleteer, musketeer, 
mutineer, pamphleteer, pioneer, privateer, 
profiteer, scrutineer, veneer, volunteer 


<eir> weird <eir> weir 

<er> adherent, cereal, coherence, <ere> mere, sere, sphere; ad/co/in-here, 
coherent, ethereal, funereal, hero, austere, belvedere, cashmere, interfere, 
inherent, managerial, material, revere, severe, sincere 


perseverance, serial, series, serious, 
serum, sidereal, venereal, zero, also 
frequent when words in <-ere> are 
suffixed, e.g, interfering 


<ier> fierce, pierce, tierce <ier> bier, pier, tier, bandolier, bombardier, 
brigadier, cashier, cavalier, chandelier, 
chevalier, clavier, corsetier, frontier, fusilier, 


gondolier, grenadier, halberdier, vizier 
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TABLE 5.5: SPELLINGS OF /1a/ IN MEDIAL AND FINAL POSITIONS, CONT. 


medial final 
<ir> kirsch <ir> fakir, kir, nadir, souvenir 
In all the words with <er> in this All the words in this column have the 


column the <r> is both a grapheme in potential for /r/-linking - see section 3.6, 
its own right spelling /r/ and part of some with change of vowel, e.g. sincerity. 
the grapheme <er> spelling /1a/. For 
dual-functioning see section 7.1. 


5.6.5 /vd/ as in rural 


This phoneme is so rare in RP that it would be futile to identify a basic 
grapheme, so | have just listed 1- and 2-phoneme graphemes. In all cases 


see Notes. 
1-phoneme <eur> only in pleurisy, where the <r> also spells 
graphemes /r/. For dual-functioning see section 7.1 


<oor> only word-final and only in boor, moor, 
poor, Spoor pronounced /bua, mua, pua, 
spua/ (also pronounced /b9:, mo:, pd:, 
sp2:/). There is /r/-linking in, e.g., boorish 
— see section 3.6 


<our> only in amour, bourbon (‘biscuit’), 
bourgeoisie), bourse, contour, detour, 
dour pronounced /dusa/ (also pronounced 
/‘dauwea/), entourage, gourd, gourmand, 
gourmet, houri, mourn (e.g. in mourning 
pronounced /'muantin/ to distinguish 
it carefully from morning /'m>:n1n/), 
potpourri if we take the second <r> as 
spelling /r/, tour, tourney pronounced 
/‘tuani:/ (also pronounced /'t3:niz/), 
tournament, tourniquet, troubadour, velour. 
There is /r/-linking in, e.g., touring - see 
section 3.6 - and in houri the <r> is both 
part of grapheme <our> and a grapheme 
in its own right spelling /r/. For dual- 
functioning see section 7.1 


2-phoneme 
graphemes 


<ur> 


<ure> 


/jua/ 
(1) 


spelt <eur> 


(2) 
spelt <ur> 
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never word-final; initially, only in urtext; 
otherwise only medial, e.g. injurious, 
insurance, juror, jury, luxuriance, 
luxuriant, luxuriate, luxurious (pronounced 
/lag'Z3uearitj-ans /ant/ert/as/), prurien-t/ 
ce, rural, usurious; also centurion, durable, 
(en)during, duress, maturity pronounced 
/sen't{uaritjan, 'd3uarabeal, (1n)'dguarin, 
dgua'res, ma'tfuariti:/, i.e. with /tj, dj/ 
affricated to /tf, dz/. In all cases except 
urtext the <r> is both part of <ur> spelling 
/ua/ and a grapheme in its own right 
spelling /r/ - for dual-functioning see 
section 7.1 


only word-final, e.g. abjure, adjure, assure, 
brochure, caricature (also pronounced with 
final /a/), conjure /kan'djua/ (‘summon 
with an oath’), cynosure, embouchure, 
endure pronounced /in'dgjua/, ensure, 
insure, mature pronounced /ma't{ua/, 
overture (if final syllable is pronounced 
/tfua/ rather than /tfa/), sure 


only in Europe (where the <r> is alsoa 
grapheme in its own right spelling /r/ - for 
dual-functioning see section 7.1) and 
liqueur pronounced /I1'kjua/ 


never word-final; initially, only in urea and 
various words derived from or cognate 

with it, e.g. Uranus pronounced /'juaranas/ 
‘urine us’ (also pronounced /ja'retnas/ ‘your 
anus’), urethra, uric, urine, urology; medial 
examples are bravura, curate (both the 
noun ‘junior cleric’ pronounced /'kjuarat/ 
and the verb ‘mount an exhibition’ 
pronounced /kjua'reit/), curie, curious, 
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furious, fury, mural, purify, purity, security, 
spurious; also centurion, durable, (en) 
during, duress, maturity pronounced 
/sen'tjuari:jan, ‘'djuarabal, (1n)'djuarin, 
djua'res, ma'tjuariti:/, i.e. with /tj, dj/ NOT 
affricated to /tf, d3/. In all cases the <r> 

is both part of <ur> spelling /jua/ anda 
grapheme in its own right spelling /r/ - for 
dual-functioning see section 7.1 


(3) only word-final, e.g. allure, coiffure, cure, 

spelt <ure> demure, endure pronounced /in'djua/, 
immure, inure, lure, manure, mature, 
ordure pronounced /ma'tjua, d:'djua/, 
overture (if final syllable is pronounced 
/tjua/ rather than /tja/), photogravure, 
pure, secure, sinecure, Ure; also in azure 
pronounced /'zzjua, 'e1zjua/ (also 
pronounced /'wzja, 'e1zja, '39, 'e139/) 


Oddities All the correspondences for this phoneme are Oddities 


NOTES 


This phoneme is rare and getting rarer in RP, and may eventually disappear. 
Its rarity means percentages for graphemes would be misleading, as would 
treating /jua/ as a separate phoneme from /ua/ in parallel fashion to 
separating /ju:/ from /ur/. Many words in which /ua/ used to occur now 
have />:/ instead. For instance, the word your used to be pronounced /jua/ 
(and still is, in some accents), but in RP is now /jo:/ - and cure, liqueur, 
mature and pure are now often heard as /kjo:, It'kjo1, ma't{>:1, pjar/ in 
up-market accents. But words like curious, fury, injurious, juror, jury, 
prurient, rural, spurious seem to be resisting the change to /d:/. Check 
your own pronunciation of the words listed in this section. 

Carney (1994: 194-5) also classifies as examples of /(j)ua/ many words 
in which letter <u> is followed by a spelling of /a/ (e.g. cruel, jewel, usual). | 
analyse these instead as being pronounced with /(j)u:/ and /a/ constituting 
a separate syllable (and an automatic intervening /w/-glide). It is noticeable 
that all these words end in a consonant phoneme. 
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In phonologically similar words which end in a vowel phoneme (which 


here is always /a/) it seems to be agreed that the ending is /'(j)u:wa/. Words 


in these (very small) groups are: 


(with /'urwa/) brewer, sewer (/'su:wa/, ‘foul drain’, as opposed to 


Sewer /'sauwa/, ‘one who sews’), interviewer, viewer (in the last two 


the /j/ glide is spelt <i> - contrast the next group); 


(with /‘ju:wa/) ewer, fewer, hewer, newer, renewer, skewer (cf. the 


homophone skua). A few derived forms also have /'u:wa/, e.g. doer 


(‘one who does’), two-er (‘conker which has broken two others’). 


In my analysis a few /(j)ua/ v. /‘(j)u:wa/ minimal pairs seem possible, e.g. 


Ure/ewer, dour/doer, tour/two-er - but the phonological difference is 


minute (and some phoneticians would say non-existent). 


5.7 Letter-name vowels: /el i: al aU ju:/, 


plus /u:/ 


5.7.1 /el/ as in aim 


THE MAIN SYSTEM 


For all these categories see Notes. 
Basic grapheme <a.e> 38%(76%in 


monosyllables) 


Other frequent <a> 27% 
graphemes 


<ay> 18% 


<ai> 12% 


regular in closed final 
syllables, e.g. dilate, make, 
take, ache, champagne 


regular in non-final syllables 
of stem words, e.g. agent, 
bacon, labour 


regular in open final 
syllables (= in word-final 
position), e.g. chardonnay, 
day, display, way; never 
initial; rare medially but cf. 
always, claymore, mayhem, 
nowadays 


regular before /nt/, e.g. paint 
(only exceptions: ain’t (sort 
of), feint); never word-final 
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THE REST 


Oddities 5% in total 


<ae> only in brae, Gaelic pronounced /'getl1k/, maelstrom, 
reggae, sundae 


<ah> only in dahlia 
<aigh> only in straight 
<ais> only in palais 


<ait> only in distrait, parfait, and trait pronounced /tre1/ 
(also pronounced /trert/) 


<alf> only in halfpence, halfpenny 

<ao> only in gaol 

<au> only in gauge 

<aye> only in aye (‘ever’) 

<e> only in 60+ more recent loanwords mainly from French 


where French spelling has <é>, namely (in non-final 
position) debris, debut, decor, eclair, ecru, elan, ingenu, 
precis; first <e> in debacle, debutante, decalage, 
decolletage, denouement, detente, elite, ingenue, 
menage, regime, seance, (Greek) heter/hom-ogeneity 
pronounced /hetar/hom-audzr'nerjiti:/ (usually 
pronounced /hetar/hom-audsr'nizjiti:/), (Old English) 
thegn and (Hawaian) ukulele; (word-finally) abbe, 
attache, blase, cafe (also pronounced with /i:/), canape 
(also pronounced with /i:/, hence the invitation | once 
received to a party with ‘wine and canopies’), cliche, 
communique, conge, consomme, coupe, diamante, 
fiance, flambe, frappe, glace, habitue, macrame 
(derived from Turkish), manque, outre, retrousse, 
risque, rose (‘pink wine’), roue, saute, soigne, souffle, 
touche, (Amerindian/Spanish) abalone, (Italian) biennale, 
finale, latte, (Greek) agape (‘love feast’), (Spanish/ 
Nahuatl) guacamole, Japanese) anime, kamikaze and 
(Mexican Spanish) tamale; final <e> in (French) emigre, 
expose, naivete, protege, recherche, resume (‘c.v.’), 
retrousse (KiSwahili/Spanish), dengue and (Turkish) 
meze. 


<ea> 


<ee> 


<e.e> 


<ei> 


<eigh> 


<er> 


<es> 


<et> 
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There is an increasing tendency to spell the French 
loanwords in this list, within English text, with <é>, 
thus signalling their status as not-yet-fully—-assimilated 
loanwords (and my spell-checker keeps inserting <é> 
where | don’t want it to) - but it could also be argued 
that this is yet another spelling complexity for native 
English speakers to cope with, especially since the 
Compact Oxford Dictionary recognises such forms 

as flambés, flambéing, flambéed, which on the other 
hand suggests that <é> is becoming a grapheme of 
English - if it does, where would it fit in the alphabet 
and therefore dictionaries? 


only in break, great, steak, yea, Yeats 


only word-final and only in about 13 more recent 
loanwords where French spelling has <ée>, namely 
corvee, dragee (‘sugar-coated sweet’) pronounced 
/‘draxzer1/ (also pronounced /'dretdzi:/), entree, epee, 
fiancee, levee (‘reception or assembly’, also pronounced 
with /i:/), matinee, melee, nee, negligee, puree, soiree, 
toupee. The tendency to use <é> is growing here too 


only in crepe, fete, renege, suede, Therese /kretp, fett, 
ri'neig, sweid, ta'rerz’/ 


only in about 15 words, namely abseil, apartheid, beige, 
deign, feign, feint, heinous pronounced /'hetnas/ (also 
pronounced /'hi:nas/), lei (only example in an open 
syllable), obeisance, reign, rein, reindeer, seine, sheikh, 
Skein, surveillance, veil, vein 


only in eight, freight, heigh, inveigh, neigh, neighbour, 
sleigh, weigh, weight 


only word-final and only in a few more recent 
French loanwords, namely atelier, croupier, dossier 
pronounced /'dpsi:jer/ (also pronounced /'dpsi:ja/), 
foyer pronounced/'fwazjet, 'foijer/ (also pronounced 
/'fotja/), metier, rentier 


only in demesne 


only word-final and only in about 20 more recent 
French loanwords, namely ballet, beret, bidet, bouquet, 
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buffet (‘food’), cabaret, cabriolet, cachet, cassoulet, 
chalet, crochet, croquet, duvet, gilet, gourmet, parquet, 
piquet, ricochet, sachet, so(u)briquet, sorbet, tourniquet, 
valet pronounced /'vele1/ (also pronounced /'vzlit/). 
/t/ surfaces (see section 7.2), always with change 

of preceding vowel, in balletic with /e/, parquetry, 
valeting with /1/ 


<ey> never initial; medially, only in abeyance, heyday, word- 
finally, only in bey, convey, fey, grey, hey, lamprey, 
obey, osprey, prey, purvey, survey, they, whey 


<ez> only in laissez-faire, pince-nez, rendezvous 
2-phoneme (none) 
graphemes 
NOTES 


If we follow Crystal (2012: 131-2) and Upward and Davidson (2011: 176-9), 
‘more recent’ in terms of loanwords from French means after the Great Vowel 
Shift, which began about AD1400 and was complete by about AD1600. 
<a> is regular in non-final syllables of stem words - see section 6.3. 
Exceptions (in addition to derived forms, e.g. daily, gaily, playing, and those 
listed among the Oddities above): aileron, attainder, caitiff, complaisant, 
dainty, daisy, gaiter, liaison, maintain, plaintiff, plaintive, raillery, raisin, 
traitor, wainscot, all with <ai>. 
<a.e> is regular in closed final syllables, including not only the large 

number of mono- and polysyllables with a single final consonant phoneme 
spelt with a single letter, but also: 

five words with two consonant letters forming a digraph representing 

a single consonant phoneme separating <a.e>: ache, champagne, 

bathe, lathe, swathe 

the small group of words ending in /etndg/ spelt <-ange>: arrange, 

change, grange, mange, range, (e)strange, with two consonant 

phonemes separating <a.e> (no exceptions) 

the small group of words ending in /erst/ spelt <-aste>: baste, chaste, 

haste, lambaste (which has the variant form lambast, with /z/), paste, 

taste, waste, still with two consonant phonemes separating <a.e> 

(exceptions: only waist as a stem word, but confusion is possible with 

a number of past-tense verbs, namely based, chased, paced). 
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The only monosyllable in which /e1/ is spelt just <a> without another vowel 
letter (and irregularly before a doubled spelling) is bass /be1s/ ‘(player of) 
large stringed instrument’ / ‘(singer with) low-pitched voice’. 

<ay> is regular in word-final position in both mono- and polysyllabic 
stem words (for exceptions see the Oddities above), and rare elsewhere in 
stem words: the only medial examples seem to be claymore, mayhem, and 
there are none in initial position. However, medial <ay> spelling /er/ does 
also occur in compound words, e.g. always, hayfever, maybe, playground, 
and frequently before suffixes, e.g. playing. 

Fortunately, there are no occurrences of word-final /e1/ spelt <ai>, thus 
reducing the possibility of confusion with <ay>, but all words with /e1/ 
spelt <ai> are still exceptions, either to the prevalence of <a> in non-final 
syllables or to the prevalence of <a.e> in closed final syllables. 

The only useful sub-rule is that <ai> is regular before /nt/, e.g. (in 
monosyllables) faint, paint, plaint, quaint, saint, spraint, taint (only 
exceptions: ain’t (sort of), feint); (in polysyllables) acquaint, attaint, 
complaint, con/di/re-straint, plus Aintree, dainty, maintain, maintenance, 
plaintiff, plaintive (the last six words being apparently the only examples 
before /nt/ in non-final syllables of stem words), but it is not predictable by 
rule elsewhere. The rule that <ai> is regular before /nt/ is one of only two 
cases where the spelling of a rime/phonogram is more predictable as a unit 
than from the separate phonemes and there are enough instances to make 
the rule worth teaching - see section A.7 in Appendix A. 

About half (by text frequency) of /erl, e1n/ spellings in closed 
monosyllables have <-ail, -ain>, e.g. ail, bail, fail, flail, frail, grail, mail, hail, 
jail, pail, quail, rail, sail, snail, tail, trail, wail (also cf. Braille); brain, chain, 
drain, fain, gain, grain, main, pain, plain, rain, sprain, stain, strain, swain, 
train, twain, vain, wain and the irregular past participles lain, slain (see 
third paragraph below), but this means that these groups have maximum 
confusability with words in <-ale, -ane>, etc., e.g. (and listing just a few 
that are homophones of words in <-ail, -ain>) ale, bale, male, hale, pale, 
sale, tale, whale; gaol, fane, lane, mane, pane, plane, vane, wane; feign, 
reign, rein, vein, and all these words have to be learnt individually. 

This is also true of: 

1) polysyllables with <ail, ain(e)>: (in non-final syllables) aileron, daily, 
gaily, raillery, attainder, wainscot; (in final syllables) assail, avail, 
curtail, detail, entail, entrails, prevail, retail, travail, wassail; ascertain, 
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chilblain, cocaine, contain, disdain, domain, entertain, explain, migraine, 
moraine, obtain, pertain, plantain, quatrain, remain, terrain 
2) the ragbag of other words (mono- and polysyllables) with <ai>, where 
there is also considerable potential for confusion: (in non-final syllables) 
caitiff, complaisant, daisy, gaiter, lackadaisical, liaison, raisin, traitor, (in 
final syllables) afraid, aid, aide, aim, aitch, arraign, bait (contrast bate), 
baize (contrast bays), braid (contrast brayed), braise (contrast brays), 
campaign (contrast champagne), (de/ex/pro-)claim, cockaigne, faith, gait 
(contrast gate), liaise, maid (contrast made), maim, maize (contrast maze), 
malaise, mayonnaise, plaice (contrast place), praise, raid, raise (contrast 
raze), staid (contrast stayed), staithe, strait (contrast straight), traipse, 
waif, waist (contrast waste), wait (contrast weight), waive (contrast wave), 
wraith and the irregularly-spelt (but not irregularly-pronounced) past 
tenses and participles laid, paid (see next paragraph). 
As pointed out under /d/, section 3.5.2, the spellings laid, paid are 
irregular; the regular spellings would be “layed, “payed. But for the irregular 
past participles of /ie (‘be horizontal’), slay the spellings lain, slain seem 
preferable to ‘layn, “layen, “slayn, “slayen, and can perhaps be counted as 
extensions of the general <y>-replacement rule - see section 6.5. 
There seem to be no words ending in /e1/ spelt <er> taking suffixes 
beginning with vowel letters, and therefore no /r/-linking (section 3.6). If 
so, this is the only such category. 


5.7.2 /i:/ as in eel 


For the absence of percentages see Notes. 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic grapheme <ee> e.g. ee! (virtually the only occurrence in initial 
position), beech, bee, see 


Other frequent <e> e.g. ether, lever, be. Regular in non- 
graphemes final syllables of stem words. In closed 
monosyllables, apparently only in retch 


<ea> e.g. each, beach, sea 


<i> e.g. chic (only example in a closed stem 
monosyllable), alien, litre, ouija, safari 


Rare graphemes 


THE REST 


Oddities 


<y> 


<e.e> 


<ie> 


<ae> 
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almost entirely word-final, where it is the 
regular spelling in polysyllables, e.g. city, 
plus rare medial examples, e.g. caryatid, 
embryo, halcyon, polysyllable 


rare in closed monosyllables; regular in 
closed final syllables of polysyllabic words, 
e.g. complete, discrete, grapheme, phoneme 


never initial. In non-final syllables only in 
chieftain, diesel. Otherwise only in final 
syllables: (closed monosyllables) brief, 

chief, fief, field, fiend, frieze, grief, grieve, 
lief, liege, mien, niece, piece, priest, shield, 
shriek, siege, thief, thieve, wield, yield; 
(open monosyllable) brie (only); (closed final 
syllables of polysyllables) achieve, aggrieve, 
Aries, belief, believe, besiege, hygiene, relief, 
relieve, reprieve, retrieve, series, serried, 
species; (open final syllables of polysyllables) 
aerie, anomie, auntie, Aussie, birdie, 

bogie, bolshie, bonhomie, boogie, bookie, 
bourgeoisie, bowie, brassie, budgie, caddie, 
calorie, camaraderie, chappie, collie, commie, 
conscie, cookie, coolie, coterie, cowrie, 

curie, darkie, dearie, eerie, eyrie, gaucherie, 
genie, g(h)illie, girlie, goalie, hoodie, laddie, 
lassie, lingerie pronounced /‘lengzari:/ (also 
pronounced /'londgare1/), luvvie, menagerie, 
movie, nightie, organdie, pixie, prairie, 
reverie, rookie, quickie, specie, stymie, 
sweetie, talkie, zombie. For ‘<i> before <e> 
except after <c>’ see section 6.1 


only in (in non-final syllables) aegis, aeon, aesthet-e/ 
ic, anaemi-a/c and other words ending /‘i:mi:ja, 'izmrk/ 
spelt <-aemi-a/c>, anaesthetist, archaeolog-ical/ist/y, 
Caesar(ian), caesura, encyclopaedia, faeces, 
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haemoglobin, Linnaean, Manichaean, mediaeval, naevus, 
paean, palaeolithic, praetor, quaestor. Many of these 
words have alternative spellings in <e>, especially in US 
spelling; (in final syllables - always open; no examples 
in closed final syllables) algae, alumnae, antennae, 
formulae, larvae, personae, pupae, vertebrae 


<ay> only finally and only in quay and compounds of day 
(birthday, holiday, Sunday, yesterday, etc.), except 
heyday, midday, nowadays, today, workaday, which 
have /er/, as does holidaying 


<ei> only medial and only in: (in non-final syllables) ceiling, 
cuneiform, disseisin, (n)either pronounced /'(n)i:da/ 
(also pronounced /'(n)airda/), heinous pronounced 
/'hitnas/ (also pronounced /‘hei:nas/), inveigle, 
plebeian, (in final syllables) caffeine, casein, codeine, 
conceit, conceive, counterfeit (also pronounced with 
/fit/), deceit, deceive, perceive, protein, receipt, 
receive, seize. For ‘<i> before <e> except after <c>’ 
see section 6.1 


<eo> only in feoffee, feoffment, people 


<ey> except in geyser pronounced /'gi:za/, only final 
and only in abbey, alley, attorney, baloney, barley, 
blarney, blimey, cagey, chimney, chutney, cockney, 
comfrey, coney, donkey, dopey, flunkey, fogey, galley, 
gooey, hackney, hockey, homey, honey, jersey, jockey, 
journey, key, kidney, lackey, malarkey, matey, medley, 
money, monkey, motley, nosey, palfrey, parley, 
parsley, pokey, pulley, storey, tourney, turkey, valley, 
volley 


<i.e> only in closed final syllables, but in at least 70 words - 
see Table 5.6 and the note below it 


<is> only finally and only in chassis, commis (chef), coulis, 
debris, precis, verdigris pronounced /'v3:digri:/ (also 
pronounced /'v3:digri:s/), vis-a-vis (last syllable) 


<it> only finally and only in esprit, petit mal, wagon-lit 


<oe> only in non-final syllables and only in amenorrhoea, 
amoeba, apnoea, coelacanth, coelenterate, coeliac, 
coelom, coenobite, coenocyte, diarrhoea, dyspnoea, 
foetal, foetid, foetus, gonorrhoea, lactorrhoea, 
logorrhoea, oedema, oenology, oesophagus, oestrogen, 
oestrus, phoenix, pyorrhoea, subpoena, plus 
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onomatopoeia, pharmacopoeia if <ia> is taken as 
spelling /i:ja/. Many of these words have alternative 
spellings in <e>, especially in US spelling 


<ois> only in chamois (the leather, pronounced /'fzmi:/ 
(also spelt shammy), as opposed to the animal from 
whose skin it is made, pronounced /'famwa:/) 


2-phoneme graphemes (none) 


NOTES 


The reason for the absence of percentages here is my re-allocation of 
word-final <y> to /i:/ rather than /1/ (see section 5.4.3), and of many 
of Carney’s /1a/ words to /i:ja/ (see section 5.6.4). Carney doesn’t give 
enough information on either set of words to calculate the effect of these 
re-allocations on the percentages for /i:/. 

Unlike most of the split digraphs, <e.e> is not very frequent - in Carney’s 
analysis (excluding final /i:/ spelt <y>) it accounts for only 3% of spellings 
of /i:/, and for only 27% even in monosyllables, and percentages counting 
in final /i:/ spelt <y> would be even lower. It is the second rarest of the 
split digraphs, the rarest being <y.e>. 

The regular spellings of /ix/ are: 

in open and closed monosyllables: <ee> 

in open final syllables (= stem-finally) in polysyllables: <y> 

in closed final syllables of polysyllables: <e.e> 

in non-final syllables, especially of stem words: <e>, but there are 

large numbers of exceptions with <i>. 
In open monosyllables the regular spelling is <ee>, as in bee, fee, flee, free, 
gee, ghee, glee, knee, lee, pee, scree, see, spree, tee, thee, three, tree, twee, 
wee. Exceptions: be, he, me, she, the (when stressed), we, ye - but these 
are all function words, which don’t have to obey the Three-Letter Rule (see 
section 4.3.2); flea, lea, pea, plea, sea, tea; key, ski; brie. 

In closed monosyllables Carney (1994: 157-8) lists 108 words with 
<ea>, 87 with <ee>, and 30 with minority spellings. However, those with 
<ee> seem more frequent, e.g. beech, cheek, cheese, deep, feed, feek, feet, 
geese, green, keep, meet, need, seem, seen, sleep, sleeve, sneeze, speech, 
speed, street, week, wheel. Also, analysing <ea> as the regular spelling here 
would seem odd, given that <ea> has several other correspondences, some 
of them with long lists of words, while <ee> has hardly any and they are 
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all rare. | therefore take <ee> to be most regular spelling of /i:/ in closed 
monosyllables. For exceptions see Table 5.6, plus chic, retch, seize. 
In open final syllables of polysyllables the regular spelling (in my analysis, 
as against Carney’s) is <y>, e.g. city, pretty. Exceptions include: 
those listed under the rare grapheme <ie> and the Oddities <ae, ay, 
ey, is, it, ois> above 
aborigine, acme, acne, adobe, anemone, apostrophe, bocce, 
catastrophe, coyote, dilettante, epitome, extempore, facsimile, (bona) 
fide, hebe, hyperbole, Lethe, machete, menarche, minke, nepenthe, 
oche, posse, psyche, recipe, reveille, sesame, simile, stele, strophe, 
tagliatelle, tsetse, ukulele, vigilante 
a few words in <-e> where pronunciation of the final phoneme varies 
between /i:/ and /et/: abalone, cafe, canape, finale, forte, furore, 
guacamole, kamikaze, karate 
one word in <-ea>: guinea 
all the words ending in <-ee> indicating ‘person to whom something 
is done’ (all with final stress), e.g. addressee, amputee, appointee, 
assignee, conferee, debauchee, dedicatee, deportee, divorcee, draftee, 
employee, enrollee, examinee, grantee, inductee, internee, interviewee, 
invitee, legatee, lessee, licensee, mortgagee, nominee, parolee, 
patentee, payee, referee, trainee, transferee, trustee, vestee 
a ragbag of other words ending in <-ee>, including: (with initial 
stress) apogee, coffee, dragee (‘sugar-coated sweet’) pronounced 
/‘dretdgi:/ (also pronounced /'dra:ze1/), filigree, fricassee, gee-gee, 
jubilee, kedgeree, levee (‘reception or assembly’, also pronounced 
with /e1/), lychee, manatee, pedigree, perigee, Pharisee, prithee, 
puttee, Sadducee, spondee, squeegee, standee, suttee, thuggee, 
toffee, trochee, yankee; (with medial stress) committee; (with final 
stress) absentee, agree, attendee, banshee, bargee, bootee, buckshee, 
chickadee, chimpanzee, decree, degree, devotee, dungaree, escapee, 
goatee, grandee, guarantee, jamboree, marquee, refugee, repartee, 
rupee, settee, truckee 
a further ragbag of mostly foreign words ending in <i>: anti, bikini, 
broccoli, chilli, confetti, deli, ennui, graffiti, khaki, kiwi, literati, 
macaroni, maxi, midi, mini, muesli, mufti, nazi, potpourri, safari, 
salami, scampi, spaghetti, sari, semi, shufti, stimuli, taxi, tsunami, 
umami, vermicelli, wiki, yeti, yogi. 
In closed final syllables of polysyllables the regular spelling is <e.e>. For 
exceptions in <ei> see the Oddities, and for those in <ea, ee, ie, i.e> see 
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Table 5.6. There are apparently just five exceptions with <i>: ambergris, 
aperitif, batik, massif, motif, and one with <e>: harem. There is also a small 
group with /i:z/ spelt <-es>, namely some plural nouns of Greek origin with 
the singular ending /1s/ spelt <-is>, e.g. analyses (/a'nelisi:z/, the singular 
verb of the same spelling being pronounced /'enalaiziz/), apotheoses, axes, 
bases (/'eksi:z, 'betsi:z /, plurals of axis, basis; axes, bases as the plurals 
of axe, base are pronounced (regularly) /‘eks1z, 'be1s1z/), crises, diagnoses, 
emphases, exegeses, nemeses, oases, periphrases, synopses, theses and all 
its derivatives, plus (Greek singulars) diabetes (also pronounced with final 
/1s/), herpes, litotes, pyrites, (a stray Greek plural with singular in <-s>) 
Cyclopes, and (Latin plurals) amanuenses, appendices, cicatrices, faeces, 
interstices, mores, Pisces, testes. 

Avery odd word thatis relevant here is dioceses. Inits singular form diocese, 
pronounced /'datjasis/, each phoneme (except the automatic /j/-glide) 
can be related to a grapheme, provided the final /s/ is analysed as spelt 
<se>. But the plural has the two pronunciations /'datjasi:z1z, ‘datjasi:z/. 
In the former, again each phoneme (except the /j/-glide) can be related 
straightforwardly to a grapheme, provided we accept that the first <e> spells 
/it/, both <s>’s spell /z/ (the first being voiced despite being voiceless in 
the singular - cf. the other words of Greek origin just listed), and the second 
<e> spells /1/. But in the second pronunciation it seems as though /i:z/ is 
spelt <-eses> and | am at a loss to know which letters to relate the two 
phonemes to - perhaps more rational spellings would be “diosis (singular), 
“dioses (plural), which would bring both forms into line with those listed 
above, and with all other words with final /s1s/, which are all spelt <-sis>, 
despite neither “diosis nor “dioses having a genuine Greek etymology. 

The five major possibilities in closed final syllables of polysyllables and 
in closed monosyllables are shown in Table 5.6. There appear to be no 
useful rules suggesting when spellings other than <e.e> in closed final 
syllables of polysyllables and <ee> in closed monosyllables occur - all the 
other words just have to be learnt. 

In non-final syllables of stem words the letter-name spelling <e> (see 
section 6.3) predominates, especially in word-initial position, where the 
only exceptions appear to be aegis, aeon, aesthete, eager, eagle, easel, 
Easter, easy, either pronounced /'i:éa/, oedema, oenology, oesophagus, 
oestrogen, oestrus. In medial syllables <e> still predominates, e.g. 
beauteous, completion, European, Jacobean, lever, phonemic, simultaneous, 
Spontaneous and thousands of others. 
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TABLE 5.6: <ea, ee, e.e, ie, i.e> AS SPELLINGS OF /i:/ IN CLOSED FINAL SYLLABLES. 


Regular 
spelling 


In polysyllables: <e.e> 


In monosyllables: <ee> on the basis 
of the argument above 


and in the paragraphs above this Table) 


Exceptions (in addition to those listed under the Oddities <ei, i, ie> 


<ea> 


impeach, 


peace; beach, bleach, breach, each, 
leach, peach, pleach, preach, reach, 
teach 


bead, knead, lead (verb), mead, plead, 
read (present tense) 


leaf/ves, sheaf/ves 


league 


beak, bleak, creak, freak, leak, peak, 
sneak, speak, squeak, streak, teak, 
tweak, weak, wreak 


anneal, appeal, conceal, congeal, repeal, 


deal, heal, leal, meal, peal, seal, squeal, 


reveal, steal, teal, veal, weal, zeal 
beam, bream, cream, dream, gleam, 
ream, scream, seam, steam, stream, 
team 

demean,; bean, clean, dean, glean, jeans, lean, 


mean, quean, wean 


cheap, heap, leap, neap, reap 


decease, decrease, increase, release; 


cease, crease, grease, lease 


appease, disease; 


ease, pease, please, tease 


leash 


beast, east, feast, least, yeast 


defeat, entreat, escheat, repeat, retreat; 


beat, bleat, cheat, cleat, eat, feat, heat, 
leat, meat, neat, peat, pleat, seat, teat, 
treat, wheat 


heath, sheath, wreath 


breathe, sheathe, wreathe 


bereave 


cleave, eave, greave, heave, leave, 
Sheave, weave 
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<ee> exceed, proceed, succeed; genteel; (regular, e.g. beef, deep, feed, green, 
esteem, redeem, boreen, canteen, seem, week) 
careen, colleen, dasheen, lateen, 
nanteen, sateen, tureen; discreet 
<e.e> (regular, e.g. complete, discrete, breve, cede, eke, eve, gene, glebe, 
grapheme, phoneme) grebe, meme, mete, Pete, rheme, scene, 
scheme, Steve, swede, Swede, theme, 
these 
<ie> achieve, aggrieve, belief, believe, besiege,| brief, fief, field, fiend, frieze, grief, 
hygiene, relief, relieve, reprieve, retrieve,| grieve, lief, liege, mien, niece, piece, 
series, species priest, shield, shriek, siege, thief/ves, 
thieve, wield, yield 
<i.e> caprice, police; pastiche; prestige; fiche, niche, quiche; clique, pique; 


fatigue, intrigue; automobile, imbecile; 
chenille; regime; benzine, brigantine, 
brilliantine, chlorine, cuisine, dentine, 
figurine, gabardine, guillotine, iodine, 
latrine, limousine, machine, magazine, 
margarine, marine, mezzanine, 
morphine, nicotine, opaline, phosphine, 
quarantine, quinine, ravine, routine, 
sardine, strychnine, submarine, 

tagine, tambourine, tangerine, terrine, 
trampoline, tyrosine, vaccine, wolverine; 
antique, boutique, critique, mystique, 
oblique, physique, technique, unique; 
cerise, chemise, expertise, valise; 
odalisque; pelisse; artiste, dirigiste, 
modiste; elite, marguerite, petite; naive, 
Khedive, recitative 


bisque; suite 


However, there are also at least a thousand exceptions - see under the 


Oddities <ae, ei, ey, oe> above, plus: 


with <i>: 


1) before <a> spelling /a/ with automatic intervening /j/-glide (Carney 


would place these words under /1a/): ammonia, anaemia, bacteria, 


begonia, camellia, chlamydia, (en)cyclopaedia, hernia, hysteria, 


media, myopia, salvia, sepia, utopia, amiable, dutiable, enviable, 


variable; congenial, jovial, managerial, material, memorial, radial, 


remedial, serial and about 450 others ending in <-ial>; barbarian, 


comedian, grammarian, guardian, pedestrian, ruffian, thespian 


and about 200 others ending in <-ian>; dalliance, luxuriance, 
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radiance, variance, suppliant, radiant, suppliant, variant, alien, 
audience, convenience, ebullience, experience, obedience, prurience, 
salience; expediency, leniency; convenient, ebullient, expedient, 
lenient, obedient, prescient, prurient, salient, sentient, subservient, 
transient; soviet; twentieth, etc.; period, sociological, axiom, 
accordion, bastion, battalion, bullion, carrion, centurion, clarion, 
collodion, ganglion, medallion, mullion, scorpion, scullion, stallion, 
chariot, patriot; commodious, compendious, curious, dubious, 
felonious, melodious, odious, previous, scabious, serious, studious, 
tedious and about 100 others ending in <-ious>; atrium, bacterium, 
compendium, gymnasium, medium, opium, potassium, radium, 
stadium, tedium and about 200 others ending in <-ium>; genius, 
radius; also second <i> in amphibious, bilious, billion, brilliancy, 
brilliant, criteri-a/on, delirium, editorial, fastidious, hilarious, 
historian, histrionic, idiom, idiot, industrial, juvenilia, memorabilia, 
millennia, oblivion, omniscience, omniscient, perfidious, perihelion, 
reptilian, resilience, resilient, trivia, vitriol, third <i> in incipient, 
initiate (noun), insidious, insignia, invidious, militaria; 

2) before other vowel phonemes with automatic intervening /j/-glide: 
ap/de-preciate, associate (verb), audio, calumniate, caviar, foliage, 
luxuriate, medi(a)eval, negotiate, orient (verb), oubliette, patio, 
radio, ratio, serviette, studio, trio, verbiage, viola (/vix'jaula/ 
‘musical instrument’); also first <i> in conscientious, liais-e/on, 
orgiastic, partiality, psychiatric, speciality, second <i> in histrionic, 
inebriation, insomniac, officiate, superficiality, vitriolic, third <i> 
in initiate (verb) 

3) before a consonant phoneme other than /j/: albino, ballerina, 
cappuccino, casino, cliché, concertina, farina, frisson, kilo, Libra, 
lido, litre, maraschino, merino, mosquito, ocarina, pinochle, 
piquant, scarlatina, semolina, visa; also first <i> in kiwi, martini, 
migraine, milieu, second <i> in bikini, incognito, libido, 

with other main-system graphemes: beacon, beadle, beagle, beaker, 

beaver, creature, deacon, feature, heathen, meagre, measles, queasy, 

reason, season, sleazy, squeamish, teasle, treacle, treason, weasel; 
beetle, cheetah, feeble, freesia, gee-gee, geezer, needle, squeegee, 

Sweetie, teeter, wheedle; chieftain and other compounds of chief.-, 

diesel, caryatid, embryo, halcyon, polyandry, polysyllable, polytechnic 

and many others with poly-. 


5.7.3 /al/ as in ice 


THE MAIN SYSTEM 


The phoneme-grapheme correspondences, 2: Vowels 


For all these categories see Notes. 


Basic grapheme_ <i.e> 
Other frequent <i> 
graphemes 
<y> 
<igh> 
THE REST 
Oddities 
<a> 
<ae> 


40% (70% in 
monosyllables) 


42% (with <ie 
(see Oddities), 
y>) 


13% 


5% in total 


205 


regular in closed final 
syllables (except in 
monosyllables before /t/ 
and consonant clusters; 
only other exception: 
mic /matk/, short for 
microphone), e.g. bike, 
sublime 


regular in non-final 
syllables, e.g. item, word- 
finally in polysyllables, e.g. 
alkali, and in monosyllables 
before consonant clusters, 
e.g. child 


e.g. beautify, by, cycle, 
psyche, sky, regular 
word-finally 


only in about 26 stem 
words (see section 10.25). 
Regular in monosyllables 
before /t/, e.g. sight. In 
non-final syllables, only in 
blighty, righteous, sprightly. 
Word-finally, only in high, 
nigh, sigh, thigh 


only in majolica pronounced /mar'joltka/ (also 


pronounced /ma'dgpl1ka/), naif, naive, papaya 


only in maestro, minutiae 
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<ai> 


<ais> 
<aye> 


<ei> 


<eigh> 


<ey> 


<eye> 
<ia> 


<ie> 


<ir> 


<is> 
<oy> 
<ui> 
<ye> 


<y.e> 


only in ailuro-phile/phobe, assegai, balalaika, banzai, 
bonsai, caravanserai, Kaiser, naiad, samurai, shanghai 


only in aisle. See Notes 
only in aye (‘yes’), aye-aye 


only in deictic, deixis, eider(down), eidetic, eirenic, 
either, Fahrenheit, feisty, gneiss, heist, kaleidoscope, 
meiosis, neither, poltergeist, seismic, stein 


only in height, sleight 


only in geyser pronounced /'gatza/ (usually 
pronounced /'gi:za/) 


only in eye 
only in diamond 


only word-final, e.g. pie (see Notes), except for 
suffixed forms after <y>-replacement (see section 6.5), 
e.g allied, supplies 


only in iron pronounced /‘atjan/ (but the Scots 
pronunciation /‘atran/ has retained the /r/ and has 
more regular correspondences) 


only in island, isle(t), lisle, viscount. See Notes 

only in coyote 

only in duiker, Ruislip 

only word-final and only in bye, dye, lye, rye, Skye, stye 


only in final syllables and only in: (monosyllables) byte, 
chyle, chyme, cyme, dyke, dyne, gybe, gyve, hythe, 
hype, rhyme, scythe, skype, style, syce, syne, thyme, 
tyke, type; (polysyllables) acolyte, analyse, anodyne, 
azyme, breathalyse, catalyse, coenocyte, condyle, 
dialyse, electrolyse, electrolyte, enzyme, formaldehyde, 
leucocyte, neophyte and at least 14 other words 
ending in /fart/ spelt <-phyte>, paralyse, phagocyte, 
proselyte, spondyle, about 20 derivatives of style, 
troglodyte, and at least 20 derivatives of type. In my 
Opinion, all these words (except gyve) could be spelt 
with <i.e> without loss 


2-phoneme 
graphemes 


3-phoneme grapheme 
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/ata/ 


(1) only medially and mainly where <-e> has been 

spelt <ir> deleted from words in the next category, e.g. aspiring, 
desirous, expiry, spiral, tiring, but there are a few 
independent examples, e.g. biro, giro, pirate, virus. |n 
all cases the <r> is both part of <ir> spelling /ata/ 
and a grapheme in its own right spelling /r/. For dual- 
functioning see section 7.1 


(2) only word-final and only in ac/in/re-quire, admire, a/ 
spelt con/in/per/re/tran-spire, attire, desire, dire, empire, 
<ire> entire, (expire, fire, hire, (be/quag-)mire, quire, saltire, 


samphire, sapphire, satire, shire, sire, spire, e)squire, 
tire, umpire, vampire, wire. Many of these words allow 
/r/-linking, e.g. inspiration, satirical, spiral, wiring - 
see previous paragraph and section 3.6 


(3) only medial and only in empyrean, gyroscope, papyrus, 

spelt <yr> pyrites, pyromaniac, thyroid, tyrant, tyro, tyrosine. In 
all cases the <r> is both part of <yr> spelling /ata/ 
and a grapheme in its own right spelling /r/. For dual- 
functioning see section 7.1. Words in which <y, r> are 
separate graphemes include dithyramb(id), myriad, 
porphyr-y/ia, tyranny, syringa, syringe, syrup, all with 
the relevant <y> spelling /1/ 


(4) only word-final and only in byre, gyre, lyre, pyre, tyre. 
spelt In my opinion these words could be spelt with <ire> 
<yre> without loss, as tire already is in US English. Some of 


these words allow /r/-linking - see section 3.6 - e.g. 
pyromaniac, and (with change of vowel and <r> 
spelling only /r/) lyrical 


/wat/ only in foyer pronounced /'fwatjet/ (also pronounced 
spelt /‘fotjet, 'fatja/), voyeur 

<oy> 

/wata/ only in choir - one of only two 3-phoneme graphemes 


spelt with in the entire language 
a single 

grapheme 

<oir> 
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NOTES 


The regular spellings of /ar/ are: 
in non-final syllables, and in monosyllables before consonant clusters: 
<i> 
in monosyllables before /t/: <igh> 
in closed final syllables (except in monosyllables before consonant 
clusters and /t/): <i.e> 
word-finally: <y>. 
<i> is regular in non-final syllables (see section 6.3), but for: 
exceptions listed under the Oddities <a, ae, ai, aye, ei, ey, ir, is, oy, ui> 
and the 2-phoneme grapheme /wati/ spelt <oy>, above, plus Blighty, 
righteous, sprightly (also spelt spritely because of its derivation from 
Sprite) 
exceptions with <y>: asylum, aureomycin, cryostat, cyanide, cycle, 
cyclone, cypress, (hama)dryad, dynamic, forsythia, glycogen, 
gynaecology, hyacinth, hyaline, hybrid, hydra, hydrangea, hydraulic, 
hydrofoil, hydrogen and various other compounds of hydro-, hyena, 
hygiene, hygrometer, hymen, hyperbole and other compounds in 
hyper-, hyphen, hypothesis and other compounds in hypo-, lychee, 
myopic, nylon, psyche and almost all its derivatives (exception: 
metempsychosis, with /1/), pylon, stymie, thylacine, thymus, typhoid, 
typhoon, typhus, xylophone, zygote and derivatives. 
<i.e> is regular in closed final syllables of polysyllabic words, e.g. alive, 
archive, capsize, combine, concise, decide, entice, exercise, oblige, senile, 
sublime. Exceptions: see the Oddities listed above under <y.e>, plus alight, 
behind, delight, Fahrenheit, fore/hind/in-sight, indict, paradigm, remind, 
uptight, watertight and suffixed forms after <y>-replacement (section 6.5), 
e.g. allied, supplies. 
In closed monosyllables: 
<i> on its own appears to be regular before consonant clusters: child, 
Christ, mild, ninth, pint, whilst, wild and the group with /atnd/ spelt 
<-ind>: bind, blind, find, grind, hind, kind, mind, rind, wind (‘turn’). 
Possible extension: If the context were defined in terms of letters, aisle, 
climb, isle, lisle could be added. Exception under either definition: heist 
<igh> is regular before /t/: bight, blight, bright, fight, flight, fright, 
hight, knight, light, might, night, plight, right, sight, slight, tight, 
wight, wright. Exceptions: height, sleight; bite, cite, kite, mite, rite, 
site, smite, spite, sprite, quite, white, write; byte 
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<i.e> is regular around other single consonant phonemes, e.g. fine, 
hive, ice, knife, like, lime, mile, mine, pipe, prize, ride, rise, including 
the small group with /6/ spelt <th>: blithe, lithe, tithe, writhe. 
Exceptions: see the Oddities listed above under <y.e>, plus aisle, 
climb, isle, lisle (but for these four words see two paragraphs above), 
mic, Stein. 

In open final syllables of polysyllabic words <y> is regular, e.g. in 130+ 
words with the suffix <-fy>, e.g. beautify, and in ally, ap/com/im/re/sup- 
ply, defy, deny, descry, espy, July, multiply (verb), occupy, prophesy, rely. 
Exceptions: assegai, shanghai, aye-aye; a fortiori/posteriori/priori, alibi, 
alkali, alumni, alveoli, foci, fundi (plural of fundus), fungi, Gemini, gladioli, 
rabbi and a few more rare words. 

In open monosyllables the most frequent spelling is <y>: by, cry, dry, 
fly, fry, my, ply, pry, scry, shy, sky, sly, spry, spy, sty, try, why, wry, plus 
buy, guy (taking <bu, gu> to be digraphs spelling /b, g/); this set numbers 
20 words. Exceptions (which number 24): aye, eye, I; die, fie, hie, lie, pie, 
tie, vie; bye, dye, lye, rye, stye; high, nigh, sigh, thigh; and the Greek letter 
names chi, phi, pi, psi, xi. A possible subrule might say that <ie> is regular 
after a single consonant letter, but this is a very small set, containing only 
the seven words die, fie, hie, lie, pie, tie, vie, and setting up this rule would 
cause problems for the grapheme-phoneme correspondences of <ie>. 

The words aisle, island, isle(t), lisle, viscount are among the oddest 
in English spelling, with <(a)is> spelling /at/ and the <s> having no 
consonantal value. Isle, lisle might have yielded to an analysis with /at/ 
spelt <i.e> and the intervening <sI> spelling /l/ - but there is no other 
warrant for a grapheme <sI>, or for the ‘split trigraph’ <ai.e> which this 
analysis would have produced for aisle (whereas there is another warrant for 
the grapheme <ais>, in palais - see under /e1/, section 5.7.1). Also, this 
analysis would not have fitted the other words listed (or those with /i:/ spelt 
<is>, see section 5.7.2). 

/wata/ also has the 2-grapheme spellings <wire> in wire and <-uire> 
in (ac/re)quire. And /ata/ has 2-grapheme spellings, e.g. <-iar> in liar, 
<-ier> in drier, <-yer> in dryer, flyer, <-igher> in higher. A possibly 
useful sub-rule is that word-initial /data/ is always spelt <dia-> (derived 
from a Greek prefix) except in dire itself and diocese. 

The <y> in coyote, foyer, voyeuris both part of the digraph <oy> spelling 
/(w)at/ and also spells /j/ on its own. For dual-functioning see section 7.1. 
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5.7.4 /au/ as in oath 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic grapheme_ <o> 
Other frequent <o.e> 
graphemes 
<ow> 
THE REST 
Oddities 
<aoh> 
<au> 
<eau> 
<eo> 
<ew> 


59% regular in non-final 
syllables, e.g. focus, finally 
in polysyllables (except 

in two-syllable words 
after /l, r/), and in closed 
monosyllables before a 


consonant cluster 
16% 
(72% in 
monosyllables) 


regular in final closed 
syllables (except in closed 
mono-syllables before a 
consonant cluster), e.g. 
bone, remote 


18% regular finally in two- 
syllable words after /I, r/ 
and in open monosyllables 

8% in total 


only in pharaoh 


only in chauffeu-r/se, chauvinis-m/t, gauche, hauteur, 
mauve, saute, taupe 


only word-final and only in bandeau, beau, bureau, 
chateau, flambeau, gateau, plateau, portmanteau, 
rondeau, tableau, trousseau and a few other very rare 
words. For the plurals of these words see /z/, section 
3.6.7, and <x>, section 9.41 


only in Yeo, yeoman, Yeovil 


only in sew, sewn, Shrewsbury plus shew(ed), shewn 
(archaic spellings of showed), shown) 
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<oa> only in (non-final syllables) gloaming; (closed final 
syllables of polysyllables) approach, cockroach, 
encroach, reproach; (closed monosyllables) bloat, boast, 
boat, broach, cloak, coach, coal, coast, coat, coax, croak, 
float, foal, foam, gloat, goad, goal, goat, groan, groat, 
hoax, loach, load, loa-f/ves, loam, loan, loath, loathe, 
moan, moat, oaf, oak, oast, oat, oath, poach, roach, 
road, roam, roan, roast, shoal, soak, soap, stoat, throat, 
toad, toast, woad; (finally) cocoa, whoa 


<oat> only in boatswain pronounced /'bausan/ (also 
pronounced /'bautswein/) 


<oe> except in throes, only word-final and only in aloe, doe, 
floe, foe, hoe, oboe, roe, schmoe, sloe, toe, woe. See also 
sections 4.3.2 and 6.6 


<oh> only in doh, kohl, Oh, ohm, soh 


<ol> only in folk, Holborn, holm, yolk and old-fashioned 
pronunciation of golf as /gauf/ 


<oo> only in brooch 


<ore> only in forecastle pronounced /'fauksal/(also 
pronounced /'fo:ka:sal/) 


<os> only in apropos 


<ot> only in argot, depot, entrepot, haricot, jabot, matelot, 
potpourri, sabot, tarot, tricot. /t/ surfaces in sabotage, 
saboteur - see section 7.2 - where the <o> spells /a/ 


<ou> only in boulder, bouquet pronounced /bau'ker/ (also 
pronounced /bu:'ker/), mould, moult, poultice, poultry, 
shoulder, smoulder, soul 


<ough> only in dough, furlough, (al)though 
<owe> only in owe 


2-phoneme (none) 
graphemes 


NOTES 


<o> is regular in non-final syllables, and the only exceptions | can find 
in stem words are boulder, bouquet pronounced /bau'ke1/, chauffeu-r/se, 
chauvinis-m/t, hauteur, gloaming, poultice, poultry, shoulder, smoulder, 
yeoman, Yeovil - though there are many more in derived forms, e.g. mould- 
er/y, moult-ed/ing - see section 6.3. 
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For nouns ending in <-o> which do or do not add <es> in the plural 
see section 6.6. 

For ‘linking /w/’ and a few cases in which a preceding <o> reduces to 
/a/ see section 3.8.7. 

The group of stem monosyllables with final /auld/ spelt <-old> is one of 
only two cases where the spelling of the rime/phonogram is more predictable 
as a unit than from the correspondences of the separate phonemes, and 
there are enough instances to make the rule worth teaching; see section A.7 
in Appendix A. The only monosyllabic stem word exception in British spelling 
is mould, and even that is spelt mold in the USA. The pattern generalises 
to the polysyllables listed above, plus solder. But this rule applies only to 
stem words, and they would have to be clearly distinguished from the past 
tenses/participles doled, foaled, holed, poled, polled, rolled, tolled. 

For final syllables of stem words see Table 5.7. 


TABLE 5.7: SPELLINGS OF /au/ IN FINAL SYLLABLES OF STEM WORDS. 


N.B. The regular (default) spellings are shown in 9 point, exceptions in 7.5 


point. 


In polysyllables 


In monosyllables 


Closed 


<o.e>, e.g. chromosome, remote 


Extension: Just one word with a 
2-letter spelling of the word-final 
consonant phoneme, namely 
cologne 


Exceptions: approach, cockroach, 
encroach, reproach, control, enrol, extol, 
patrol. behold, cuckold, blind/mani-fold, 
marigold, scaffold, threshold (see also 
paragraph below Table); revolt; almost 


Before a consonant cluster: <o>, 
e.g. bold, cold, fold, gold, hold, old, 
scold, sold, told, wold (see also 
paragraph below Table); bolt, colt, 
dolt, jolt, volt; don’t, wont, won’t; 
ghost, host, most, post 


Exceptions: boast, coast, oast, roast, toast, 
mould (hence the more consistent US 
spelling mold), moult. Also exceptions in 
phonetic (but not orthographic) terms are 
coax, hoax, but cox, *hox or “coxe, *hoxe 
would not work, the first two because they 
would suggest the wrong vowel sound, the 
last two because <x> never occupies the 
‘dot’ position in split digraphs - see section 
A.6 in appendix A. 


Before a single consonant 
phoneme: <o.e>, e.g. bone (72% of 
spellings in monosyllables) 
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Extension: Five words with 

2-letter spellings of the word- 
final consonant phoneme, namely 
brogue, rogue, vogue, toque, clothe 


Exceptions: gauche, mauve, taupe; sewn, 
shew-ed/n, bloat, boat, broach, cloak, 
coach, coal, coat, croak, float, foal, foam, 
gloat, goad, goal, goat, groan, groat, loach, 
load, loaf, loam, loan, loath, loathe, moan, 
moat, oaf, oak, oat, oath, poach, roach, 
road, roam, roan, shoal, soak, soap, stoat, 
throat, toad, woad (for coax, hoax see 
above); kohl, ohm, folk, yolk; boll, droll, 
poll, roll, scroll, stroll, toll; holm, comb; 
both, loth, quoth, sloth, troth; brooch; soul; 
bowl, blown, flown, grown, known, mown, 
own, show-ed/n, sown, thrown 


Open 


In two-syllable words after /I, r/: 
<ow>, namely bellow, below, 
billow, callow, fallow, fellow, follow, 
hallow, hollow, mallow, mellow, 
pillow, sallow, shallow, swallow, 
tallow, wallow, whitlow, willow, 
yellow; arrow, barrow, borrow, 
burrow, farrow, furrow, harrow, 
marrow, morrow, narrow, sorrow, 
sparrow, yarrow 


Exceptions: aloe, cello, furlough, tableau; 
bureau, burro, pharaoh, tarot 


In longer words and other two- 
syllable words: <o>, e.g. gecko, 
gizmo, Leo, manifesto, potato, 
Scorpio, tomato, Virgo 


Exceptions: bandeau, chateau, flambeau, 
gateau, plateau, portmanteau, rondeau, 
trousseau; cocoa; oboe; apropos; argot, 
depot, sabot, tricot; although; bestow, 
bungalow, elbow, escrow, furbelow, 
meadow, minnow, shadow, widow, 
window, winnow 


<ow>, namely blow, bow (goes 

with arrow), crow, flow, glow, grow, 
know, low, mow, row (‘line, use 
oars’), show, slow, snow, sow (‘plant 
seed’), stow, throw, tow 


Exceptions: beau; sew, shew; fro, go, lo, 
no, so; whoa; doe, floe, foe, hoe, roe, sloe, 
throe, toe, woe; doh, soh, dough, though, 
owe 
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5.7.5 /ju:/ as in union 


The only 2-phoneme sequence to which | accord quasi-phonemic status - 


see sections 1.12 and 2.4. 


THE MAIN SYSTEM 


Basic graphemes <u> 


<u.e> 


Other frequent <ew> 
grapheme 


Rare grapheme  <ue> 


THE REST 


Oddities 
<eau> 


<eu> 


82% (with 
<u.e>) 


15% 


percentage 
not known - 
see Oddities 


regular in non-final syllables, e.g. 
pupil, union, word-final only in 
coypu, menu, ormolu, parvenu 


regular in closed final syllables, 
e.g. attribute, mute, use 


never initial; in non-final 
syllables, only in newel, Newton, 
pewter, steward, otherwise, only 
in final syllables and only in 
(closed) hewn, lewd, mews, newt, 
thews; (open) clerihew, curfew, 
curlew, few, hew, Kew, knew, 
mew, mildew, nephew, new, 
pew, phew, sinew, skew, smew, 
spew, stew, view, yew; from this 
(admittedly short) list, <ew> 
appears to be regular word- 
finally in monosyllables - see 
Notes 


appears to be regular word- 
finally in polysyllables - see Notes 


3% in total (including <ue>) 


only in beauty and derivatives 


only in various words of Greek origin, e.g. eucalyptus, 


eucharist, eudaemonic, eugenic, eulogy, eunuch, 


euphemism, euphorbia, euphoria, eureka, eurhythmic, 


euthanasia, leukaemia, neural, neurone, neurosis, 


Odysseus, Pentateuch, Perseus, pneumatic, pneumonia 


and other words and names. 


The phoneme-grapheme correspondences, 2: Vowels 215 


derived from Greek Trvebya pneuma (‘breath’) 

or TrvebWwv pneumon (‘lung’), pseudo and all its 
derivatives including (colloquial) pseud, therapeutic, 
Theseus, zeugma, plus various very rare words; plus 
(non-Greek) deuce, euchre, Eustachian, feu, feudal, 
neuter, neutr-al/on, teutonic, only instances in 
monosyllables are deuce, feu, feud, pseud; word-final 


only in feu 
<ewe> only in ewe, Ewell, Ewelme 
<ui> only in nuisance, pursuit 
<ut> only in debut. /t/ surfaces in debutante - see section 
7.2 
<uu> only in vacuum pronounced /'vekju:m/ 
2-phoneme All the graphemes in this section are 2-phoneme graphemes 
graphemes 
NOTES 


All the spellings listed above are used to spell /jur/. <eau, ewe, ut, uu> 
are used only to spell /ju:/, while the rest are used to spell both /ju:/ and 
/ur/ - see next section. 

<u> is the regular spelling of /ju:/ (and /ur/) in non-final syllables (see 
section 6.3), e.g. pupil, union. Exceptions: see the polysyllables listed under 
<ew> and the Oddities <eau, eu, ui> above. 

How can <u> function as the regular spelling of both /jux/ and /u:/ 
in non-final syllables without causing confusion? Because there are hardly 
any minimal pairs, words kept apart in meaning solely by the presence or 
absence of /j/. The only pairs I’ve been able to find in non-final syllables 
are beauty/booty, bootie (but not bootee, with stress on second syllable) 
and pewter/Pooter - and note that none of these words has <u> as the 
relevant spelling (and the last word is an invented surname). Similarly, | 
can find no minimal pairs separated only by the presence or absence of 
/j/ in the final syllables of polysyllables, and only a few such pairs/sets 
among monosyllables, namely beaut (Australian slang), butte/boot; cue(d/s), 
queue(d/s), Kew/coo(ed/s); cute/coot; dew, due when pronounced /dju:// 
do; ewe, yew, you/Ooth)!, feud/food, few/phoo; hew(s), hue(s), Hugh(es/’s)/ 
who(se); hewn/Hoon; Home pronounced /hjurm/, Hu(l))me/whom; lewd/ 
looed, lieu/loo; mew/moo, mewed/mood; mews, muse/moos, mute/moot,; 
pews), Pugh(s)/poo(s), Pooh(’s); pseud/sued (to me, these are /sjurd, su:d/ 
respectively, though for many speakers they are homophones, in one 
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pronunciation or the other); puke/Pook; pule/pool, use (verb)/ooze. Some 
people with Welsh accents distinguish threw, through as /®rju: , Oru:/, but 
for most speakers these are both /@ru:/. Again, it is noticeable that none of 
these words has <u> as the relevant spelling, though some have <ue, u.e>. 

The names Hugh, Hughes, Lamplugh, Pugh are the only words containing 
the grapheme <ugh> (the exclamation ugh contains two graphemes, <u, 
gh>), but because it occurs only in names | have not added <ugh> to the 
inventory of graphemes. 

<u.e> is regular in closed final syllables of polysyllables, e.g. attribute 
(only exceptions: pursuit, vacuum) and in closed monosyllables, e.g. mute, 
use (exceptions: deuce, feud; hewn, mews, newt, thews; Ewell, Ewelme). 

<ue> is only word-final and found only in (monosyllables) cue, hue, 
queue; (polysyllables) ague, argue, avenue, barbecue, continue, curlicue, 
ensue, imbue, pursue, rescue, retinue, revenue, revue, value, venue. Despite 
the shortness of the list just given, <ue> appears to be the regular spelling 
in word-final position in polysyllables (exceptions: curfew, curlew, mildew, 
nephew; coypu, menu, ormolu), and therefore qualifies as part of the main 
system. Also, as a grapheme <ue> has only two pronunciations (see section 
10.37), and one of them is /jur/. 

However, in word-final position in monosyllables <ew> appears to be 
regular (see the list above). Exceptions: ewe; cue, hue, queue. 

/ju:/ also has 2-grapheme spellings, e.g. in adieu, view, yew, you where 
/j/ is spelt <i, y> - see under /j/, section 3.7.8, and /ur/, next. But the 
l-grapheme spellings listed above predominate, especially <u>. Here, /j/ 
is not spelt separately but subsumed in the 2-phoneme spelling. 


5.7.6 /u:/ as in ooze 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic grapheme <oo> 39% e.g. ooze, booze, zoo. Regular in 
closed monosyllables, e.g. zoom 
and about 80 other words; also 
regular in polysyllables both 
word-finally, e.g. bamboo, and 
in the stressed ending /‘urn/, 
e.g. afternoon, baboon, rare 
elsewhere 


Other frequent 
graphemes 


Rare grapheme 


Frequent 
2-phoneme 
sequence 


THE REST 


Oddities 


<u> 


<u.e> 


<O> 


<ew> 


<ue> 
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27% (with regular in non-final syllables, e.g. 
<u.e>) rudiments, super 


regular in closed final syllables 
of polysyllables, e.g. intrude, 
recluse 


15% only in 11 stem words and their 
derivatives - see Notes. Carney’s 
percentage excludes do, to, who, 
which would distort the figures 


9% regular word-finally in 
monosyllables, e.g. blew 


<1% except for gruesome, muesli and 
Tuesday pronounced /'tf{u:zdi:/, 
only word-final and only in 
accrue, blue, clue, construe, (resi/ 
sub-)due, flue, glue, imbrue, issue, 
rue, Slue, sprue, sue, tissue, true. 
See Notes 


/jux/, with 10 spellings - see previous section 


<ee> 


<eu> 


<ieu> 


<oe> 


<0.e> 


<oeu> 


<ooh> 


10% in total 


only in leeward pronounced /'lu:wad/ (also 
pronounced /'li:wad/) 


only in rheum(ati-c/sm), sleuth, plus adieu, lieu, 
purlieu pronounced /a'djus, |jur, 'p3:ljur/ with <i> 
spelling /j/ (lieu is also pronounced /lu:/) 


only in lieu pronounced /lu:/ (also pronounced/Iju:/) 
only in canoe, hoopoe, shoe 


only in combe, lose, move, prove, whose /ku:m, 
lurz, muy, prurv, hurz/ and gamboge pronounced 
/gzm'bu:3/, plus derived forms. See Notes 


only in manoeuvre 


only in pooh 


218 Dictionary of the British English Spelling System 


<ou> 7% only in 
(in non-final syllables) accoutrement, acoustic, 
bivouac, boudoir, boulevard, bouquet, boutique, 
carousel, cougar, coupon, coulomb, coulter, coupe, 
coupon, croupier, crouton, embouchure, goujon, 
goulash, insouciance, louvre, moussaka, oubliette, 
outré, ouzo, pirouette, rouble, roulette, routine, 
silhouette, soubrette, soufflé, souvenir, toucan, 
toupee, troubadour, trousseau, voussoir 
(in closed final syllables) ampoule, barouche, 
canteloupe, cartouche, (un)couth, croup, douche, 
ghoul, group, joule, mousse, recoup, rouge, route, 
soup, troupe, wound (‘harm’) 
(finally) bayou, bijou, caribou, frou-frou, marabou, 
sou, you 


<oue> only in denouement, moue 


<ough> only in brougham, through 


<oup> only in coup 
<ous> only in rendezvous 
<out> only in mange-tout, ragout, surtout 
<oux> only in billet-doux, roux 
<ui> only in bruise, bruit, cruise, fruit, juice, recruit, 
Sluice, suit 
<uu> only in muumuu (twice) 
Other 2-phoneme graphemes (none) 


NOTES 


On <oo> see also Notes under /u/, section 5.4.6. 

All the spellings listed above are used to spell plain /u:/. Those beginning 
with <o> are used only to spell plain /u:/, while those beginning with <e, 
u> (except <ee>) are also used to spell /jux/ - see previous section. 

No rules can be given for when /u:/ is spelt <o> because it occurs 
only in the following 11 stem words: (monosyllables) do, to, tomb, two, 
who, womb; (polysyllables) caisson pronounced /ka'su:n/, canton (‘provide 
accommodation’, pronounced /kzn'tuin/), catacomb, lasso, zoology, plus 
derivatives including cantonment, lassoing, whom, derivatives of zoology 
with initial <zoo-> (Greek, ‘living thing’) forming two syllables pronounced 
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/zu:'wo/ if the second syllable is stressed, otherwise /zu:wa/, derived 
forms of the very few words in which /ur/ is spelt <o.e> (see Oddities), 
e.g. approval, movie, removal, and the proper nouns Aloysius /zlu:'wifas/, 
Romania, Wrotham /'ruxtam/. 
<u> is the regular spelling of /ux/ (and /ju:/) in non-final syllables, e.g. 
rudiments, super - see section 6.3. Exceptions (in addition to derivatives 
of words with /ur/ spelt <o, 0.e>, e.g. cantonment, lassoing,; ap/dis/ 
im/re-prove, approval, movie, remove, and among the Oddities above): 
brewer, jewel, sewage, sewer (‘foul drain’); bazooka, booby, boodle, boogie, 
boomerang, booty, canoodle, coolie, doodle(bug), googly, hoodoo, hoopoe, 
kookaburra, loony, moolah, noodle, oodles, oolong, poodle, voodoo, plus 
Aloysius, Romania, Wrotham. For how <u> functions as the regular spelling 
of both <ur> and <jur> in non-final syllables, see previous section. 
In closed final syllables of polysyllables: 

the stressed ending /urn/ is mostly spelt <-oon>: afternoon, baboon, 

bassoon, buffoon, cartoon, cocoon, doubloon, dragoon, festoon, 

harpoon, lagoon, lampoon, macaroon, maroon, monsoon, octaroon. 

Exceptions: caisson pronounced /ka'surn/ (also pronounced /'ketsan/), 

canton (‘provide accommodation’, pronounced /kzn'tu:n/) 

otherwise the regular spelling is <u.e>, namely in include, intrude 

and various other words in <-clude, -trude>, plus peruque, abstruse, 

recluse, peruse, brusque /bru:sk/ (also pronounced /brask/), etc. For 

exceptions see Oddities, plus vamoose. 
Exceptions to the rule that <oo> is the regular spelling in closed 
monosyllables are: 

with <u.e>: spruce, truce; ruche; crude, prude, rude; luge; fluke; rule, 

tulle; brume, flume, plume; prune, rune; jupe; ruse; brute, chute, flute, 

jute, lute 

others: rheum, sleuth; shrewd, strewn, tomb, whom, womb, combe, 

lose, move, prove, whose; croup, douche, ghoul, group, joule, louche, 

mousse, rouge, route, soup, troupe, wound (‘harm’), youth; ruth, truth; 

bruise, bruit, cruise, fruit, juice, sluice, suit. 
In word-final position the most frequent spellings are <-oo> in polysyllables, 
<-ew> in monosyllables: 

polysyllables: ballyhoo, bamboo, buckaroo, cockatoo, cuckoo, 

didgeridoo, hoodoo, hullaballoo, kangaroo, kazoo, shampoo, taboo, 

tattoo, voodoo. Exceptions: adieu, purlieu; cashew, eschew, purview, 
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review, lasso; caribou, marabou; ecru, guru, jujitsu, juju, impromptu; 
accrue, construe, imbrue, issue, residue, subdue, statue, tissue, virtue 
monosyllables: brew, chew, crew, dew pronounced /dgu:/, Jew, screw, 
shrew, strew, view, yew and the irregular past tenses blew, drew, flew, 
grew, slew, threw. Exceptions: lieu (/lu:/); do, to, who; boo, coo, goo, 
loo, moo, shoo, too, poo, woo, zoo; sou, you; flu, gnu (taking <gn> as 
spelling /nj/, though gnu could alternatively be analysed as having 
(like gnat, gnaw, etc.) /n/ spelt <gn> and /ju:/ spelt <u> - take your 
pick); blue, clue, due pronounced /dgu:/, flue, glue, rue, sue, true. 
Despite the rarity of <ue> as a spelling of /ur/ it has to be counted as part 
of the main system because as a grapheme (see section 10.37) it has only 
two pronunciations, both frequent, and one of them is /ur/. 


6. Some spelling rules for 
vowels 


It is notoriously the case that English vowel spellings are much less 
predictable than consonant spellings (compare chapters 5 and 3), so in 
this chapter | provide some guidance on this - but be warned (again): the 
guidance doesn’t and can’t cover every word, so | end up saying ‘The rest 
you just have to remember’. Such (relatively) easy bits as there are for vowel 
spellings are summarised at the beginning of chapter 5. 


6.1 ‘<i> before <e> except after <c>’ 


This is the only spelling rule most British people can recite. Stated as baldly 
as that it is thoroughly misleading. A letter in Times Higher Education in the 
summer of 2008 (Lamb, 2008) provided a more nuanced formulation: 


‘<i> before <e> except after <c> if the vowel-sound rhymes with bee’. 


The qualification ‘if the vowel-sound rhymes with bee’ (or similar) is hardly 
ever mentioned, perhaps because it is difficult to explain to children - but 
let us explore it. 

In order to use the expanded rule, writers have first to realise that an /i:/ 
phoneme they wish to spell needs to be written with one of the graphemes 
<ei, ie> and not with any of the other possibilities - not necessarily an 
easy matter (a quick look at section 5.7.2 will reveal that there are 15 ways 
of spelling /i:/ in English besides <ie, ei>, some admittedly very rare). If 
they do realise they must choose between <ei> and <ie>, they will find 
that the expanded rule works pretty well for ‘<i> before <e>’ (= not after 
<c>): there are at least 90 words with /i:/ spelt <ie>, and only two of 
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these are exceptions to the rule: specie, species. But it works very poorly for 
‘<e> before <i> after <c>’: the only words that conform to it are ceiling, 
conceit, conceive, deceit, deceive, perceive, receipt, receive, and exceptions 
are more numerous: caffeine, casein, codeine, cuneiform, disseisin, heinous, 
inveigle, Keith, plebeian, protein, seize, plus either, leisure, neither in their 
US pronunciations, and counterfeit if you pronounce it to rhyme with feet. 

| suppose you could count all these words together and say that the rule 
works for about 90 per cent of them - but the second half of the rule is weak, 
and writers are mostly left with no guidance on the myriad other words 
in which <ei> and <ie> occur without rhyming with bee - for examples 
see sections 10.12 and 10.23 (especially the set of words containing the 
sequence ‘cie’ which naive spellers who forget the ‘when the vowel-sound 
rhymes with bee’ condition may well be confused about: ancient, coefficient, 
conscience, conscientious, deficiency, deficient, efficiency, efficient, 
omniscience, omniscient, prescience, prescient, proficiency, proficient, 
science, scientific, society, sufficient, sufficiency) - or in which /ix/ is not 
spelt either <ie> or <ei>. In my opinion, this rule should be consigned to 
oblivion. 


6.2 ‘To spell the names of letters <a, i, 0, u> 
in one-syllable words ending with a 
single consonant phoneme, write the 
vowel-name letter and the consonant 
letter and magic <e>’ 


This fact is well known, but not often expressed like this. Examples are 
too numerous and familiar to need listing. The rule holds good about 
three-quarters of the time for relevant monosyllables. There are about 60 
‘letter-name-vowel except /i:/ plus single consonant’ endings in English 
monosyllables, and this rule works well for all but a handful of them. For 
example, the only word ending /aup/ and spelt with <-oap> is soap - all 
the rest are spelt with <-ope>, including scope, slope. The main exceptions 
are that /erl, e1n/ are spelt <-ail, -ain> about as often as they are spelt 
<-ale, -ane> (see section 5.7.1), and that the principal spelling of /art/ is 
<-ight> (see section 5.7.3). 

The rule also applies, but less strongly, to the final syllables of 
polysyllabic words where the full letter-name sounds including /i:/ occur, 
and regardless of whether the syllable is stressed or unstressed. 
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There are two important limitations: it doesn’t apply to phoneme /i:/ 
in monosyllables, and all words containing /at/ spelt <y.e> (see section 
5.7.3) are exceptions. So it could be stated more exactly (but less usefully 
for teaching purposes) as: 


‘In words ending in a single consonant phoneme, spell letter-name vowels 
(EXCEPT /i:/ in monosyllables) with their name letters plus the consonant 
letter plus magic <e> (and watch out for words spelt with <y> and magic 
<e>).’ 


In monosyllables ending in a consonant very few occurrences of /i:/ are spelt 
<e.e>, and the main spelling of /ix/ is <ee>, but there are many exceptions 
- and even more, numerically, in the final syllables of polysyllabic words 
even though there <e.e> is the most frequent pattern (see section 5.7.2). 


6.3 ‘In non-final syllables of stem words, 
spell letter-name vowels with their 
name letters’ 


This is my generalisation of various regularities stated in sections 5.1 and 
5.7: the letters <a, e, i, o, u> are the regular spellings of phonemes /e1, i:, 
ai, au, jur/, plus /u:/, in non-final syllables, that is, outside one-syllable 
words and the final syllables of polysyllabic words. The rule applies to both 
stressed and unstressed syllables where the full letter-name sounds occur. 
Long lists of examples can be found in Wijk’s Rules of Pronunciation for the 
English Language, especially pp.19-20, 22-26, 69, 73. A few representative 
examples (in stressed syllables before the semi-colons; in unstressed 
syllables after them) are: 
/eI/ spelt <a>: agent, baby, bacon, capable, crustacean, danger, 
data, hazel, ingratiate, insatiable, labour, lady, loquacious, nation and 
all the other words ending in /'eifan/, plagiarism, stranger, wastrel; 
fatalistic 
/it/ spelt <e>: amenable, appreciable, decent, diabetes, European, 
frequent, idea, Leo, lever, medieval, museum, neon, oedema (second 
syllable), penalise, pleonasm, region, senior, sequence, species, 
theatre; abbreviation, area, galleon, geographic, hideous and about 
80 others ending in <-eous>, nucleus, petroleum 
/at/ spelt <i>: annihilate, bicycle, climate, dialogue, diaphragm, 
disciple, giant, hierarch, inviolable, liable, library, lion, rival, siphon, 
violence; criterion, diabetes, diarrhoea, gigantic, idea, iota 
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/au/ spelt <o>: diplomacy, focus, iota, lotion and all the other words 
ending in /'‘aufan/ (including ocean itself, despite the rest of its 
spelling), molten, motor, negotiable, ocean, ochre, only, profile, rosy, 
sociable, swollen, coerce, cryostat, Eloise, grotesque, loquacious, obese 
/ju:/ spelt <u>: alluvial pronounced /a'lju:vitjal/, computer, numerous, 
peculiar, reducible, stupid, unit; intuition pronounced /t1ntju:'w1fan/ 
/u:/ spelt <u>: alluvial pronounced /a'lurvisjal/, inscrutable, 
judo, lunatic, scrutiny, suicide; fluorescent, intuition pronounced 
/Int{ur'wifan/, judicial, superior. 
There are of course exceptions to all of these, e.g. 
/er/ not spelt <a>: Gaelic pronounced /'gerl1k/, maelstrom; aileron, 
caitiff, complaisant, daisy, gaiter, liaison, maintain, maintenance, 
raillery, raisin, traitor, wainscot; bayonet, cayenne, crayon, layer, 
layette, maybe, mayonnaise, rayon; debacle, debris, debut(ante), 
decolletage, decor, denouement, detente, eclair, elan, elite, ingenu(e), 
menage, precis, regime, séance, ukulele; heinous pronounced 
/‘hetnas/, obeisance, reindeer, neighbour, abeyance, heyday, laissez- 
faire, rendezvous 
/ix/ not spelt <e>: 
1) Exceptions with <i> (there are at least 1000 words in this category 
- see under /ix/, section 5.7.2): 
(stressed) albino, ballerina, casino, cliché, concertina, farina, 
kilo, lido, litre, maraschino, merino, mosquito, ocarina, piquant, 
scarlatina, semolina, visa; first <i> in kiwi, migraine; second <i> in 
bikini, incognito, libido 
(unstressed) ap/de-preciate, associate, audio, calumniate, caviar, 
foliage, luxuriate, mediaeval (second syllable), negotiate, orient, 
oubliette, patio, radio, ratio, serviette, studio, trio, verbiage; 
also first <i> in conscientious, liais-e/on, orgiastic, partiality, 
psychiatric, speciality, second <i> in inebriation, insomniac, 
officiate, superficiality, vitriolic, third <i> in initiate 
2) Other exceptions: aegis, aeon, aesthete, anaemi-a/c and other 
words ending /‘i:mta, ‘izmik/ spelt <-aemi-a/c>, anaesthetist, 
archaeology, Caesar, encyclopaedia (fourth syllable), faeces, 
haemoglobin, mediaeval (third syllable), naevus, praetor, quaestor, 
beacon, beadle, beagle, beaker, beaver, creature, deacon, eager, 
eagle, easel, Easter, easy, feature, heathen, meagre, measles, 
queasy, reason, season, sleazy, squeamish, teasle, treacle, treason, 


Some spelling rules for vowels 225 


weasel. beetle, cheetah, feeble, freesia, gee-gee, geezer, needle, 
Squeegee, sweetie, teeter, wheedle, ceiling, cuneiform, heinous 
pronounced /'hi:nas/, inveigle; feoffee, feoffment, people; geyser 
pronounced /'gi:za/; amoeba, coelacanth, coelenterate, coeliac, 
coelom, foetal, foetid, foetus, oedema (first syllable), oenology, 
oesophagus, oestrogen, oestrus; phoenix, subpoena; caryatid, 
embryonic), halcyon, polyandry, polysyllable, polytechnic. \n US 
spelling many of the words just listed with <ae, oe> are instead 
spelt with <e>, thus conforming to the rule 

/at/ not spelt <i>: 

1) Exceptions with <y>: asylum, aureomycin, cryostat, cyanide, cycle, 
(hama)dryad, dynamic, forsythia, glycogen, gynaecology, hyacinth, 
hyaline, hybrid, hydra, hydrogen, hyena, hygiene, hygrometer, 
hymeneal, hyperbole and other compounds in hyper-, hyphen, 
hypothesis and other compounds in hypo-, lychee, myopic, nylon, 
psyche and all its derivatives, pylon, thylacine, thymus, typhoid, 
typhoon, typhus, xylophone, zygote and derivatives 

2) Other exceptions: naive, papaya; maestro, balalaika, Kaiser, 
naiad; aye-aye; deictic, deixis, eider, eidetic, eirenic, either, feisty, 
kaleidoscope, meiosis, neither, seismic; geyser pronounced /'gaiza/; 
blighty; iron, island, islet, viscount; coyote, foyer pronounced 
/‘fwatjer/, voyeur; duiker, Ruislip 

/au/ not spelt <o>: chauffeu-r/se, chauvinis-m/t, hauteur, yeoman; 
gloaming; boulder (contrast the comparative adjective bolder), bouquet 
pronounced /bau'ker/, poultice, poultry, shoulder, smoulder 
/ju:/ not spelt <u>: beauty; feudal, leukaemia, neurosis, pseudo; 
skewer, nuisance 
/ux/ not spelt <u>: leeward pronounced /'lu:wad/; pleurisy, rheumatism, 
brewer, jewel, sewage, sewer (‘foul drain’); approval, movie and other 
derivatives of words with /ur/ spelt <o.e>; manoeuvre; bazooka, 
booby, boodle, boogie, boomerang, booty, canoodle, coolie, doodle(bug), 
googly, hoodoo, hoopoe, loony, moolah, noodle, oodles, poodle, voodoo; 
accoutrement, acoustic, boudoir, boulevard, bouquet, boutique, carousel, 
coulomb, cougar, coupon, croupier, goulash, insouciance, louvre, 
moussaka, oubliette, outré, ouzo, rouble, roulette, routine, silhouette, 
soubrette, soufflé, souvenir, toucan, toupee, troubadour, trousseau, 
voussoir, denouement, gruesome, muesli, Tuesday. 
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But the generalisation seems mainly sound for stem words. It is particularly 
strong for /au, jur/ spelt <o, u>; the only exceptions I’ve been able to 
find are the 13 and 7 respectively just listed. It’s weakest for /ix/ spelt 
<e>, where there are over 1000 exceptions, and there are of course other 
instances in derived forms, e.g. (to name just a few) mould-er/y, moult-ed/ing, 
fewer, hewer. 

For how the <e>-deletion rule makes many derived forms conform to 
this rule see the next section. 


6.4 <e>-deletion (Part 2 of ‘double, drop or 
swop’) 


For ‘double, drop or swop’ see also section 4.2 and the next section. 
The main rule for dropping a word-final letter <e> when adding a suffix 
is easily stated: 


In words which end in <e> preceded by a consonant letter, drop the <e> 
before suffixes beginning with a vowel letter. 


Examples: arousal, arrival, assemblage, baker, behaviour, chaplain, 

collegial, convalescent, debatable, drudgery, forcible, hated, muscly (}), 

revival, rousing, storage, surety, treasury, wiry, writing. 

Note that: 

when the suffix begins with <e> (past tense and participle <-ed>, 
agentive or comparative adjective-forming <-er>, superlative 
adjective-forming or archaic second person singular person tense 
ending <-est>, archaic third person singular person tense ending 
<-eth>, verb singular or noun plural <-es>, adjective-forming 
<-ent>, noun-forming <-ery, -ety>), technically the <e> of the stem 
is dropped and replaced by the <e> of the suffix, even though it looks 
simply as though <d, r, st, th, s, nt, ry, ty> has been added. A few 
quite odd words belong here, e.g. bizarrery, freer, freest, weer, weest 
(comparative and superlative forms of the adjectives free, wee), freest, 
freeth, seest, seeth (/'frisjist, ‘frisj10, 'si:jrst, 'sizjr1@/, archaic second 
and third person singular present tense forms of the verbs free, see), 
sightseer /'sattsitja/. In the words containing <e, e> those two letters, 
unusually, do not form a digraph; 
<e>-deletion makes many words in which it applies (including 
arrival, collegial, debatable, hated, revival and writing) conform to 
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the generalisation in the previous section about letter-name vowels in 
non-final syllables. Thus in the spelling ageing, now probably more 
frequent than aging, the <e> is strictly speaking unnecessary, as is 
the first <e> in mileage (also spelt milage); 
even more unnecessary is the <e> in axeing, which is starting to appear 
but should definitely be axing, the only possible form in US spelling 
where the unsuffixed form is ax; in 2012 | also noticed apeing, which 
should be aping. Similarly, the (US?) spelling knowledgeable strictly 
speaking has an unnecessary <e>, since knowledgable conforms 
better to the general <e>-deletion rule. 
Exceptions: Where the consonant letter preceding word-final <e> is <c, g> 
forming a digraph with the <e> spelling /s, d3/ (whether or not the <e> is also 
part of a split digraph), the <e> is retained in order to show that <c, g> are 
not pronounced /k, g/, for example in noticeable, peaceable, pronounceable, 
serviceable, traceable; advantageous, changeable, chargeable, damageable, 
manageable, marriageable, outrageous (therefore not “noticable, *peacable, 
*“pronouncable, *servicable, *tracable; ‘advantagous, ‘“changable, *chargable, 
“damagable, “managable, “marriagable, “outragous). The <e> is also 
retained in routeing, singeing, swingeing, whingeing to avoid confusion 
with routing (from rout), singing, swinging and winging; also in bingeing, 
cringeing, sponge-ing/y to avoid suggesting the existence of stem words 
“to bing, “to cring, "to spong - though there was of course Bing (Crosby), 
and Spong is a rare but real surname. Also, the <e> in acreage, (a/un-) 
bridgeable, ogreish, ochreous, saleable, unshakeable is never deleted. 
Conversely, when <-or> is added to mortgage, which should produce the 
spelling “mortgageor, the result is instead mortgagor, thus both breaking 
what we might call the ‘<e>-retention’ rule as it applies to words ending in 
/d3/ spelt <ge> and producing one of the few words in which <g> before 
<o> is pronounced /dz/ (see section 9.15). And the <e> of more is retained 
in moreover. 

The past tense form recceed is very odd, not just visually (my spellchecker 
tried to change it to recede) or because of the irregular spelling of /k/ as 
<cc> before <e>, but also because the stem-final <e> isn’t deleted before 
<-ed>. If it were, “recced would look as though it was pronounced /rekt/, 
like wrecked. Similarly, the participle recceing also has to retain the <e> to 
spell /iz/. 

The adjective fiery is always so spelt, and never ‘firey or “firy. There 
appear to be no words ending <irey>, but ‘firy might seem a more logical 
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application of the <e>-replacement rule -perhaps there is a feeling that, 
because fiery is pronounced /'‘fatjari:/, the schwa needs to be represented. 
Extensions: 
Adjectives ending in <-able, -ible> drop the <e> and add <y> to 
form adverbs ending -ly, e.g. probably, visibly. 
(Almost all other adverbs add <-ly> but this would produce, e.g., 
*“probablely, “visiblely which might suggest the presence of a non- 
existent schwa vowel corresponding to the <e>; and omitting the <e> 
from those forms would produce “probablly, “visiblly, which would go 
against the tendency to reduce <Il> to <I> - see section 4.4.7) 
Whole loses the <e> when <-ly> is added: wholly (though some 
dictionaries list the form wholely, contrast solely, which is never spelt 
*solly). 
A few words optionally lose the <e> before <-ment>: abridg(e)ment, 
acknowledg(e)ment, judg(e)ment, and argument always does. 
Where loses an <e> in wherever (but not in whereas, whereat, 
whereupon). 
Nine loses the <e> before <-th>, and while before <-st>: ninth, 
whilst (presumably so that they will not look as though they have two 
syllables, like archaic third or second person singular present tense 
verbs: “nineth, “whilest). 
One noun ending in <-ue> drops the <e> when adding <-ery> to 
form a derived noun: demagoguery. 
Adjectives ending in <-ue> drop the <e> when adding <-ly> to form 
adverbs: duly, truly, and true loses the <e> in truism, but blueish 
keeps it. 
A few verbs ending in <-ue> lose the <e> before <-able>: arguable, 
issuable, rescuable, subduable, suable, valuable. These six words 
(plus changeable, debatable, saleable and serviceable) all fit the 
generalisation about <-able> versus <-ible> (see section 6.7). 
The extensions, noted in the last three bullet points above, of <e>-deletion 
to some suffixations of words ending in <-ue> never apply where the 
letter before the <e> is a vowel letter other than <u>, e.g. hoeing. Even 
where the preceding vowel letter is <u> some words can retain or drop the 
<e> before <-ing>, e.g. cuing/cueing, queuing/ queueing, but most other 
words with <ue> always drop the <e>, e.g. arguing, burlesquing, issuing, 
rescued, subdued, suing, valued. 
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However, where the stem ends in <-gue> the position is complicated. If 
the suffix begins with <e, i> AND the pronunciation of the <g> remains /g/ 
after suffixation only <e> is deleted, e.g. catalogued, intriguing, voguish 
(though | have also seen vogueish in print); but both letters <-ue> are 
deleted if either the pronunciation of the <g> changes to /d3/ after suffixing, 
e.g. analogy, dialogic, ideological, or the suffix begins with <a>, e.g. fugal, 
vagary (N.B. ‘vaguery does not appear to exist). And then there is the group 
of 3 words ending in <-ngue> spelling /n/: harangue, meringue, tongue 
retain the <u> in haranguer, haranguing, meringued, tongued, presumably 
to prevent the <ng> appearing to spell /nd3/: “haranger, “haranging, 
“meringed; in the case of forms of harangue, also to prevent the second 
<a> looking as though it spells /er/; and in the case of tongue, to avoid 
confusion with derivatives of tong, e.g. tonged. 


6.5 <y>-replacement (Part 3 of ‘double, 
drop or swop’) 


For ‘double, drop or swop’ see also sections 4.2 and 6.4. 
The rule for replacing a word-final letter <y> with <i> when adding a 
suffix is more complicated than that for <e>-deletion: 


DON’T change the <y> if the preceding letter is a vowel letter, e.g. 
playing, or if the suffix is <-ing>, e.g. crying. 

Otherwise, in words which end in <y> preceded by a consonant letter: 

1) change the <y> to <ie> before <-s>, e.g. tries; 

2) change the <y> to <i> before other suffixes, e.g. tried. 


Extensions and exceptions: 

The ‘multiples-of-ten’ numerals from twenty to ninety change the <y> 
to <i> before <-eth>: twentieth, thirtieth, etc. 

A few examples of <-y> changing to <i> before <-a> might be 
considered extensions, e.g. porphyry, porphyria. 

There seems to be only one word where <y> after a vowel letter 
exceptionally is deleted before a suffix beginning with a vowel letter: /aity. 

There seem to be only four words where <y> changes exceptionally to 
<e>: beauteous, duteous, piteous, plenteous. 

Where the preceding letter is a consonant letter, most words change 
<y> to <i> before <-ful, -fy, -hood, -less, -ly, -ment, -ness, -some, 
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-work>, e.g. beautiful, bountiful, dutiful, fanciful, merciful, pitiful, plentiful, 
beautify, dandify, glorify, jollify, ladify, mummify, prettify, likelihood, 
livelihood; merciless, penniless, crazily, drily (also spelt dryly), greedily, 
wittily, accompaniment, embodiment, merriment, business (‘enterprise’, 
pronounced /'biznis/), foolhardiness, spiciness, weightiness, wearisome; 
handiwork. But bellyful, babyhood, shyly, slyly, wryly, busyness (‘state of 
being busy’, pronounced /'bizi:nis/), dryness, shyness, slyness, wryness, 
bodywork keep the <y>. Several other apparent exceptions to this paragraph 
(joyful, playful, joyless, coyly, greyly, coyness, greyness, glueyness) 
are obeying the part of the rule that says ‘Don’t change the <y> if the 
preceding letter is a vowel letter’. A great oddity is multiplication, which 
could be (mischievously) analysed as a derived form of the verb multiply 
with an otherwise unknown suffix <-cation> - in May 2009 I came across 
an instance of a child reported as writing “multiplycation. 

The adjective and adverb daily, the adverb gaily, the past tenses and 
participles laid, paid and the past participles Jain, slain have <i> despite 
day, gay, lay (present tense of laid, past tense of lie ‘be horizontal’), pay, 
slay having a vowel letter before the <y>; the regular spellings of daily, 
gaily, laid, paid would be “dayly, “gayly, “layed (and on 30 June 2010 | saw 
the form ‘overlayed in an exhibition caption at the British Library), “payed. 
The irregularity in laid, paid consists not just in changing <y> to <i> but 
also in omitting the <e> of the regular past tense and participle ending 
<-ed>. It is more difficult to work out what the ‘regular’ spellings of /ain, 
slain would be. They are irregular past participles formed with the ending 
usually written <-en> (e.g. broken, written), but they also seem to be 
the only cases where this ending is added to stems ending in <-y>. The 
spellings “layen, “slayen would, however, look disyllabic even though the 
words are monosyllables - the few occurrences of medial /e1/ spelt <ay> 
are all in polysyllables, whether stem or compound words (see /e1/, section 
5.7.1) - but on the other hand “layn, “slayn would spell medial /e1/ with 
<ay> when this correspondence never occurs in monosyllables. So perhaps 
lain, slain are logical after all. 

The nouns fryer (as in deep-fat-) and dyer (‘person who dyes’) always 
have <y>. The noun meaning ‘thing that dries’ (as in hair-, hand-, tumble-) 
can be spelt drier or dryer, but the adjective meaning ‘more dry’ must be 
drier. The nouns meaning ‘an aircraft pilot, a handbill’, etc., can be either 
flier or flyer, but the adjective meaning ‘more knowing and clever’ is always 
flyer (and why this differs from the adjective drier escapes me). 
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6.6 <ie>-replacement, <y>-deletion and 
<e>-insertion 


I’ve invented these terms to draw attention to three processes which contrast 
with those in the two previous sections. <ie>-replacement is regularly 
remarked upon, and <e>-insertion is a notorious source of confusion for 
some people, but <y>-deletion has hitherto apparently escaped notice. 


<ie>-replacement: There are just five verbs in which the opposite ‘swop’ 
to <y>-replacement occurs, that is, <ie> is changed to <y> before 
<-ing>, namely belying, dying, lying, tying, vying. 


What | have called ‘<y>-deletion’ occurs where abstract nouns ending in 
<-y> correspond to ‘agentive’ nouns ending in <-er, -ist>, e.g. astrolog-y/ 
er, astronom-y/er, biograph-y/er, geograph-y/er, philosoph-y/er, botan-y/ 
ist, chiropody-y/ist, geology-y/ist, misanthrop-y/ist, theor-y/ist, etc. This 
might also cover the loss of <y> in /aity relative to lay (‘not clergy’). 

I’ve invented the term ‘<e>-insertion’ to draw attention to the Oddity 
of some polysyllabic nouns which end in <-o> in the singular adding <e> 
before the plural ending <-s>, e.g. heroes, potatoes, tomatoes. This occurs 
only in nouns ending in <-o> (and in rare occurrences of such nouns being 
used as singular verbs, e.g. The submarine torpedoes the battleship), and 
never with other word-final single vowel letters: bananas, clichés, rabbis, 
menus, not “bananaes, “clichées, “rabbies, “menues. Carney (1994: 174) 
points out that there are a few clear rules here: 


‘The <-oes> form is not found in decidedly Exotic words (generalissimos, 
mulattos) or in words where the plural is unusual (indigos, impetigos) or 
in words with the colloquial ending <-o> (boyos, buckos, dipsos, winos) 
... Lor] if there is a vowel before the final /au/ ... (radios, cameos).’ 


And to the last group one could add patios, rodeos, studios. 
But otherwise one is reduced to listing words which: 

only have <-os>, e.g. concertos, espressos, provisos, quartos, solos 

only have <-oes>, e.g. buboes, heroes, potatoes, tomatoes, torpedoes 

can have either, e.g. cargo(e)s, commando(e)s, halo(e)s, tornado(e)s. 
In my opinion, nothing but a source of confusion (for Dan Quayle, among 
others; this story re-appeared in the Guardian Education Section, 19 August 
2008, p.3) would be lost if all such polysyllabic words were spelt only with 
<-os> (but monosyllables like doe, floe would still have plurals in <-oes> 
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because the <e> is there in the singular; throes would still need its <e> 
despite not having a singular form, goes, noes and the verb does would need 
to be exceptions, and past tenses would still need the <e>, e.g. torpedoed). 


6.7 <-able/-ible > 


These adjective endings are an awkward pair. The origins of the two spellings 
go back to Latin. (If the adjective’s root is descended from a Latin verb of the 
2nd, 3rd or 4th conjugation, the suffix is generally spelt <-ible>; otherwise, 
<-able> - and a fat lot of use that rule is to most people.) Pronunciation 
of the endings is no guide to which adjectives have which ending - both 
are pronounced /abal/, and there are almost no related forms in which 
the stress falls on the relevant syllable and gives the vowel its full value, 
thus removing the uncertainty. (The only exception seems to be syllable - 
syllabic, and here the <-able> word is a noun, not an adjective, and derived 
from Greek, not Latin). 
However, there is a generalisation which is fairly reliable: 


Try saying the adjective without the /abal/. If the result is a free-standing 
word, or ends in /k/ spelt <(c)c>, /g/ spelt <g> or /f/ spelt <ci, ti>, 
spell the ending with <a>. Otherwise, spell it with <i>. 


Examples: biddable, suitable, walkable, amicable, applicable, despicable, 
educable, impeccable, implacable, irrevocable, practicable; navigable; 
appreciable, sociable, insatiable, negotiable; (with retained <e>) traceable, 
manageable, eligible, illegible, intelligible, susceptible, plus the noun 
crucible. 

Exceptions: 

1) Where the root does not sound like a free-standing word but the 
ending has <a>: abominable, admirable, affable, amenable, charitable, 
culpable, demonstrable, disreputable, equable, equitable, execrable, 
flammable, formidable, hospitable, impregnable, incalculable, ineffable, 
inestimable, inevitable, inexorable, inimitable, inscrutable, inseparable, 
interminable, inviolable, irritable, malleable, memorable, miserable, 
palpable, permeable, probable, tolerable, venerable. 

2) Where the root does sound like a free-standing word but the ending 
has <i>: accessible, contemptible, convincible, defensible, discernible, 
flexible, forcible, gullible, reducible, responsible, sensible; also legible 
and the noun mandible even though not derived from ledge, manned. 
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Curiously, the generalisation works for some <-able> words where the root 
sounds like a free-standing word but that word isn’t related to the adjective 
ending in <-able>, for example amiable, capable, liable, syllable, tenable, 
viable, as though derived from Amy, cape, lie, sill, ten, vie. In some cases 
it is necessary to remove a prefix before the test works, e.g. (un)palatable. 


6.8 <-ant/-ent, -ance/-ence, -ancy/-ency > 


There are two useful generalisations for <-ant/-ent>: 

1) The unstressed ending /mant/ is almost always spelt <-ment>. 
Examples (N.B. when these words are nouns; when words of the same 
spelling are verbs or take the adjectival endings /al, ariz/ spelt <-al, 
-ary> the <e> is traditionally pronounced /e/ (though this distinction 
is dying out), and this helps to indicate the <e> spelling): complement, 
compliment, document, element, excrement, experiment, ferment, 
fragment, implement, increment, instrument, supplement. Extension: 
The adjectives (in)clement also have /mant/ spelt <-ment> but have 
no related verb. 

Exceptions: adamant, claimant, clamant, dormant, informant. 

2) The ending /‘esant/ is always spelt <-escent>, e.g. adolescent, 
convalescent. 

Otherwise all these paired endings are if anything even more awkward than 

<-able/ -ible>. Again, the source of the spellings is Latin, and pronunciation 

is of little help - unless there’s a related word in which the stress falls on the 
relevant syllable and the full sound of the vowel removes the uncertainty. 

For example: 


circumstAnce(s) - circumstAntial 
componEnt - componEntial 
confidEnce - confidEntial 
consequEnce - consequEntial 
differEnce, differEnt - differential 
dominAnt - dominAtion 

elemEnt - elemEntal, elemEntary 
elephAnt - elephAntine 


influEnce - influEntial 
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jubilAnt - jubilAtion 

lubricAnt - lubricAtion 

migrAnt - migrAtion 

mutAnt - mutAtion 

presidEncy, presidEnt - presidEntial 
protestAnt - protestAtion 
residEnce, residEnt - residEntial 


substAnce - substAntial 


6.9 Using related forms to spell schwa 


Finding the full vowel in a related word is also the clue to spelling /a/ in 
many other words. In the following examples, the words on the left have 
capitalised vowels spelling schwa whose spelling can be derived from the 
capitalised vowel letters in the words on the right: 


abdOmen - abdOminal 

AcadEmy - AcadEmic 

acAdemic - acAdemy 

adamAnt - adamAntine 

advocAcy, advocAte (noun) - advocAte (verb) 
anAlyse, anAlytic - anAlysis 

Analysis - Analyse, Analytic 

anARchy - anARchic 

artEry - artErial 

associAte (noun, adjective) - associAte (verb) 
articulAcy, articulAte (adjective) - articulAte(d) (verb) 
atOm - atOmic 

Atomic - Atom 

biOlogical - biOlogy 

biolOgy - biolOgical 
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canOn - canOnical 

cAnonical - cAnon 

cAtholicism - cAtholic 

celEbrate - celEbrity 

cElebrity - cElebrate 

cOlloquial, cOlloquium - cOlloquy 
collOquy - collOquial, colloquium 
colUmn - colUmnar 

cOlumnar - cOlumn 

compOnential - compOnent 
cOnfirm - cOnfirmation 

cUstodial - cUstody 

custOdy - custOdial 

definite - finite 

dramA - dramAtic 

drAmatic - drAma 

duplicAte (noun/adjective) - duplicAte (verb) 
essEnce - essEntial 

Essential - Essence 

factOR - factORial 

frequEnt (adjective) - frequEnt (verb) 
grammAR - grammARian 
grAmmarian, grAmmatical - grAmmar 
infinite - finite 

majEsty - majEstic 

mAjestic - mAjesty 

medAl - medAllion 

mEdallion - mEdal 


memOry - memOrial 
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mEmorial - mEmory 

mentAl - mentAlity 

methOd - methOdical 
mEthodical - mEthod 

monARch(y) - monARchical 
mOnarchical - mOnarch(y) 
Obligatory - Obligation 

octAgon - octAgonal 

orAcle - orAcular 

Oracular - Oracle 

palAce - palAtial 

pAlatial - pAlace 

pArabOla - pArabOlic 

parAbolic - parAbola 

patriOt - patriOtic 

perfEct (adjective) - perfEct (verb) 
photOgraph - photO, photOgrapher 
phOtogrApher - phOtogrAph(ic) 
populAR - populARity 

prOcure - prOcurator 

prOfessOR - prOfessORial 
profEssorial - profEssor 
psychiAtry - psychiAtric 

regulAR - regulARity 

separAte (adjective) - separAte (verb), separAtion 
sepUlchre - sepUIchral 

sObriety - sOber 

sOciety - sOcial 


sulphUR(ous) - sulphURic 
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syllAble - syllAbic 

telEgraph, telEgraphic - telEgraphy 
telegrAphy - telegrAph, telegrAphic 
theAtre - theatrical 


variAnt - variAtion 


As the list shows, many pairs provide reciprocal guidance. 


6.10 Elided vowels 


Even more difficult for novice spellers and non-native learners may be words 
where a vowel letter appears in the written version that has no counterpart 
at all (not even schwa) in the spoken version. Five examples in common 
words are: 

secondary /'sekandri:/ with no phoneme corresponding to <a> 

different /‘difrant/ with no phoneme corresponding to <e> 

business /‘biznts/ with no phoneme corresponding to <i> 

category /'keztagri:/ with no phoneme corresponding to <o> 

favourite /‘fetvrit/ with no phoneme corresponding to <ou>. 
Even these few words show significant variability in the vowel grapheme 
that needs to be recovered and written. In this section | enclose many such 
elided vowel letters in round brackets - for this convention see Wijk (1966: 
77-8). Vowel elision is one of four special processes which | have identified 
as operating in English spelling (for the others see section 3.6 and chapter 7), 
and has serious implications for any attempt to deduce the stress patterns 
of words from their written forms (see section A.10 in Appendix A). 

The reason for this syncopation or telescoping phenomenon seems to 
be that English-speakers dislike having to say three syllables containing 
unstressed vowels consecutively, and tend to drop one where this would 
be the case. We even do it sometimes where there would be only two 
consecutive unstressed syllables. And it affects not just single words, but 
also strings of words in running speech, e.g. | should have thought can 
be telescoped into something like /atftaf'Oa:t/, with three syllables rather 
than four. At the extreme this is the process which allowed W.S. Gilbert to 
create outrageous rhymes such as monotony rhyming with got any, that is, 
monot’ny - got ’ny /ma'notni: - 'gotni:/. 
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This trend towards eliding vowels seems to be due to the nature of 
English as a stressed-timed language. It is particularly strong in RP, so 
some non-native learners with experience of a wide range of the accents 
with which English is spoken might be helped by the US pronunciations of 
some words in this category: 

secondary /'sekandeari:/ with /ea/ corresponding to <a> 

category /‘kztagotrit/ with />:/ corresponding to <o>. 
But no helpful vowel phoneme surfaces in mid-Atlantic in different, so some 
such words will continue to pose problems. 

Besides which, a great many native-speaking children learning to spell 
English will receive no such help from their own accents or those of people 
around them, and almost certainly won’t notice the relevant details of 
different accents heard on television, video or DVD or at the cinema. 

The largest category (that word again) of words with an elided vowel is 
those ending in /ris/ spelt <-ry> and with the main stress two syllables 
earlier and /a/ or /1/ in the syllable after the stress: 

with /a/ in that syllable: 

syllab(a)ry, apothec(a)ry; dromed(a)ry, legend(a)ry, second(a)ry, 
customa)ry, concession(a)ry, coron(a)ry, diction(a)ry, discretion(a) 
ry, legion(a)ry, mercen(a)ry, mission(a)ry, ordin(a)ry, precaution(a) 
ry, probation(a)ry, pulmon(a)ry, reaction(a)ry, revolution(a)ry, 
station(a)ry, urin(a)ry, vision(a)ry, advers(a)ry, emiss(a)ry, necess(a) 
ry; comment(a)ry, diet(a)ry, fragment(a)ry, moment(a)ry, necess(a) 
ry, propriet(a)ry, salut(a)ry, secret(a)ry, sedent(a)ry (also pronounced 
with stress on the second syllable and with a schwa corresponding to 
<a>), tribut(a)ry, volunt(a)ry. In extr(a)ordin(a)ry /1k'stro:danri:/ the 
first <a> is also elided 

cemet(e)ry, chancellke)ry, confection(e)ry, dysent(e)ry, jewellery 
(the alternative spelling jewelry avoids the problem), monast(e)ry, 
station(e)ry. In confection(e)ry, jewellery, station(e)ry the fact that 
<-y> is a suffix would help give the correct spelling 

categ(o)ry,; promiss(o)ry; amat(o)ry, conciliat(o)ry, conservat(o)ry, 
contribut(o)ry, declamat(o)ry, defamat(o)ry, desult(o)ry, dilat(o)ry, 
explanat(o)ry, explorat(o)ry, inflammat(o)ry, interrogat(o)ry, invent(o) 
ry (also pronounced with stress on the second syllable and with a 
schwa corresponding to <o>), laborat(o)ry, lavat(o)ry, mandat(o) 
ry, nugat(o)ry, obligat(o)ry, observato)ry, offert(o)ry, orat(o)ry, 
predat(o)ry, preparat(o)ry, promont(o)ry, purgat(o)ry, repert(o)ry, 
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retaliat(o)ry, signat(o)ry, statut(o)ry. Some words that fit this pattern 

in one pronunciation don’t in another (even within RP, let alone the 

differences between RP and GA), e.g. migratory as either /‘matgratri:/ 

(three syllables, stress on first, <o> elided) or /mat'greitari:/ (four 

syllables, stress on second, no elision). 

with /1/ in that syllable: 

lapid(a)ry; vineg(a)ry; culin(a)ry, imagin(a)ry, prelimin(a)ry, budget(a) 

ry, dignit(a)ry, heredit(a)ry, milit(a)ry, monet(a)ry, pituit(a)ry, 

planet(a)ry, sanit(a)ry, solit(a)ry, unit(a)ry,; 

millin(e)ry, presbyt(e)ry. In these words the fact that <-y> is a suffix 

would help give the correct spelling 

alleg(o)ry, audit(o)ry, de/ex/re/sup-posit(o)ry, dormit(o)ry, inhibit(o) 

ry, territ(o)ry, transit(o)ry. 
In many cases, where adjectives in the preceding lists add /li:/ spelt <-ly> 
to form adverbs the tendency to elide the vowel seems to me to be even 
stronger. A few examples would be moment(a)rily, necess(a)rily, statut(o) 
rily, volunt(a)rily, when stressed on the first syllable; those with <a> 
can alternatively be stressed on the <a>, in which case it is a full vowel 
pronounced /e/, e.g. necessarily pronounced /nesa'sertli:/ rather than 
/'nesasraliz/. 

In a very few cases a related word with a surfacing vowel might help: 


seminary - seminArian 
adversary - adversArial 
sanitary - sanitAtion 
dysentery - dysentEric 
presbytery - presbytErian 
allegory - allegOrical 
category - categOrical 
lavatory - lavatOrial 
oratory - oratOrical 


territory - territOrial 


But writers who already know the words in the right-hand column would 
surely already know the correct spellings of the words in the left-hand 
column; and in secretary, secretarial the pronunciation of the adjective 
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might mislead uncertain spellers into writing “secretery, “secreterial (and | 
won’t go into pronunciations such as /'sekateri:/ where the first /r/ is lost). 
Extensions (1), where the ending is still /rizs/ spelt <-ry> but the stress 

pattern is not as predictable: 
A few words where the stress is on the syllable immediately preceding 
the elided vowel: gooseb(e)rry /‘guzbri:/, raspb(e)rry /'ra:zbri:/, 
strawb(e)rry /'stro:briz/), where in normal pronunciation there is no 
schwa vowel after the /b/; annivers(a)ry, compuls(o)ry, element(a)ry, 
evle)ry, fact(o)ry, hist(o)ry, myst(e)ry, nurs(e)ry, vict(o)ry pronounced 
/'evrit, 'feektriz, 'hrstris, 'mistriz, 'viktri:/. Very rapid pronunciations 
of February, diary, library, boundary may also be contracted to 
two syllables /'febriz, 'datri:, 'latbri:, 'baundri:/, but this is usually 
considered too colloquial 
lit(e)rary /'lttraris/, (con)temp(o)rary /(kan)'temprari:/, where the 
spoken ending is /rari:/ and this is spelt <-rary>, so that the elided 
vowel is immediately after the stressed syllable. In temporarily 
pronounced /tempe'rertli:/ there is a schwa in the relevant position 
but it gives no guide to the spelling, and the word may in any case 
be pronounced with three syllables: /'temprali:/; however, if it is 
pronounced /tempa'reartli:/ this would guide the <a> spelling 
vet(e)rin(a)ry, which is usually pronounced (in RP) with only three 
syllables /'vetrinri:/, so that there are two elided vowels, needing to 
be spelt with the second <e> and the <a> (Mr Biggins, the farmer in 
All Creatures Great and Small, reduced it even further, to /'vetanr1/) 
a few nouns ending in <-tuary>: act(u)ary, est(u)ary, mort(u)ary, 
obit(ujary, sanct(u)ary, stat(ujary, volupt(u)ary which are normally 
pronounced with /tfari:/ rather than /tfuari:/). 

Extensions (2), where the ending is no longer /ri:/ spelt <-ry> but the 

consonant after the elided vowel is still /r/: 
Words ending in /rabal, rativ, ratrst/ spelt <-rable, -rative, -ratist>, 
where the preceding vowel always seems to be elided: adm(i)rable, 
comp(a)rable, consid(e)rable, deliverable, faviou)rable, hon(ou)rable, 
inex(o)rable, mis(e)rable, op(e)rable, pref(e)rable, vuln(e)rable; dec(o) 
rative, fig(u)rative, gen(e)rative, op(e)rative; sep(a)ratist. In some 
cases the unsuffixed stem will help: consider, deliver, favour, honour, 
miserly), prefer, décor, figure 
Words where the vowel is sometimes elided, sometimes not, but the 
unsuffixed stem or a related word would normally guide the spelling: 
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advent(u)rous, barb(a)rous (cf. barbarian), conf(e)rence, dang(e)rous, 
def(e)rence, diffie) rence, diffle)rent; ent(e)ring, favou)rite, laund(e)rette 
(the alternative spelling laundrette avoids the problem; in the spelling 
with three <e>’s this word is unusual in having the main stress on the 
syllable after the elided vowel), leve)rage, nat(u)ral, off(e)ring, prefte) 
rence, prosp(e)rous, suffie)rance, temp(e)rament, utt(e)rance 
Two adjective/verb pairs where a vowel is always or almost always 
elided in the adjective but a schwa in the verb may help to indicate 
where a corresponding vowel letter needs to be written: delib(e)rate 
(/dr'lzbrat/ (adjective) with three syllables - contrast deliberate 
/dr'ltbareit/ (verb) with four syllables; sep(a)rate /‘seprat/ (adjective) 
with two syllables - contrast separate /'separeit/ (verb) with three 
syllables, separation, where there is a schwa corresponding to the first 
<a>, though itis no guide to the correct spelling, so that “seperat-e/ion 
are amongst the most frequent misspellings in English 
Words where the vowel is always or almost always elided and stems 
or related words do not help: adm(i)ral, asp(i)rin, avle)rage, cam(e)ra, 
Cath(e)rine, consid(e)rate (if pronounced with three syllables), corp(o)ral, 
corp(o)rate, desp(e)rate, em(e)rald, gen(e)ral, int(e) rest, lib(e) ral, lit(e)racy, 
literal, lit(e)rature, op(e)ra, rest(au)rant, rhinoc(e)ros, seve)ral, 
sovle)reign, temp(e)rament, temp(e)rature; also prim(a)rily /'‘praimrali:/ 
with 3 syllables, where the alternative 4-syllable pronunciation 
/prat'mertli:/ would show that a vowel letter is needed after the <m> 
but the unusual correspondence of /e/ spelt <a> (see section 5.4.2) 
might mislead some writers into spelling the word “primerily 
Words where the vowel might be elided in very rapid pronunciation 
but normal pronunciation would reveal the prefix: hyp(e)ractive, 
hyp(e)rintelligent, int(e)ractive, int(e)ragency 
the Latin phrase et cet(e)ra. 

Extensions (3), where the consonant after the elided vowel is /I/: 
cath(o)lic pronounced /'kz#@ltk/- contrast cathOlicism, chanc(e)llor, 
choc(o)late, om(e)lette; origin(a)lly /a'ridginli:/; p(o)liceman pronounced 
/'plizsman/; fam(i)ly pronounced /'femli:/ 
adverbs with the unstressed ending /1kli:/ which is almost always 
spelt <-ically>, e.g. radic(a)lly. Since all the corresponding adjectives 
end in /1kal/ spelt <-ical> there should be no problem with spelling 
these adverbs, except when the pattern is overgeneralised to the 
few adverbs which are exceptions to it: follicly (challenged) (jocular 
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form derived from follicle), impoliticly, politicly ‘judiciously’, stress 
on first syllable), publicly, not “follically (but this form seems to be 
gaining ground), “impolitically, “publically - but politically (‘pertaining 
to government’, stress on second syllable) does exist, as the adverb 
from political. Extensions: equivoc(a)lly, unequivoc(a)lly 
adjectives ending (in rapid speech) in /tfal/ spelt <-tual>: accent(u)al, 
act(u)al, concept(u)al, contract(ujal, effect(u)al, event(ujal, fact(u)al, 
habitual, intellect(uyal, mut(ujal, perpet(u)al, punct(ujal, rit(u)al, 
spirit(uyal, text(ujal, virt(ujal. These words also have a_ slower 
pronunciation in /tfu:wal/ where the vowel is not elided and guides 
spelling 
adverbs derived from the adjectives just mentioned, e.g. act(u)ally, 
which seem to me to be pronounced with /tfali:/ more with often than 
/t{urwali:/ but the spelling of the adjectives would guide the spelling 
of the adverbs 
some adverbs ending in <-fully> pronounced /fli:/, e.g. beautif(u)lly, 
dutif(u)llyl. 
The next phoneme is also /I/ ina large set of words whose stems end in /al/ 
spelt <-le>, where the schwa is lost when these words have a suffix added 
which adds a syllable, e.g. peddle /'pedal/ v. peddling /‘pedlin/, but these 
words do not add any consonant letter + elided vowel sequences to the 
inventory below because the <e> is deleted before the initial vowel letter of 
the suffix. For much more on this set of words see section 4.4.3. 
Extensions (4), adverbs where the consonant after the elided vowel is /r/ 
and there is an ending /li:/ spelt <-ly>, e.g. advent(u)rously, delib(e) rately, 
irrep(a)rably, pref(e)rably, nat(u)rally. Here again | think that the tendency 
for the vowel to be elided is stronger than in the corresponding adjectives, 
but the spelling of the adjectives would guide the spelling of the adverbs. 
Extensions (5), where the consonant after the elided vowel is /n/: 
ars(e)nal, ars(e)nic, broad(e)ning, bus(i)ness, christ(e)ning, deep(e)- 
ning, defti)nitely, eve)ning, falc(o)ner, fash(io)nable, fresh(e)n-er/ing, 
fright(e)n-er/ing, gard(e)n-er/ing, laud(a)num, list(e)n-er/ing, nati(o)nal, 
nom(i)native, op(e)n-er/ing, pers(o)nal, prelim(i)nary, rati(o)nal, 
reas(o)nable, seas(o)ning, sharp(e)ner, sweet(e)n-er/ing, weak(e)ning, 
wid(e)ning (for opening see also section 10.28) 
twop(e)nny, halfp(e)nny /'tapni:, ‘herpniz/ which no longer exist 
except in the memories of aging Brits like me, but where the ending 
was contracted to /pnirz/. 
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For most of the words in this subcategory stems or related forms do help. 
Extensions (6), a final ragbag: 
caf(e)tiere - contrast café, cafeteria 
comflor)table - contrast comfort 
ecz(e)ma - contrast eczematous 
forec(a)stle pronounced /'fauksal / 
med(i)cine - contrast medicinal 
ramekin pronounced /'remkin/ (also pronounced /'remikin/) 
veg(e) table. 
In this section | have identified 49 consonant letter(s)-plus—elided vowel 
sequences (and there are probably others I’ve not noticed). Just 17 of these 
appear as consonant graphemes in chapter 3: 
<de> spelling /d/ is needed for aide, horde, etc., as well as for 
considerable 
<fe> spelling /f/ is needed for carafe as well as for cafetiere, deference, 
preferably, etc. 
<ffe> spelling /f/ is needed for gaffe, giraffe, pouffe as well as for 
different; etc. 
<ge> spelling /dgj/ is needed for words like image as well as for 
vegetable 
<gi> spelling /d3/ is needed for words like legion 
<ke> spelling /k/ is needed for Berkeley, burke as well as for 
weakening 
<lle> spelling /l/ is needed for bagatelle, vaudeville, etc., as well as 
for chancellery 
<me> spelling /m/ is needed for become, handsome, etc., as well as 
for camera, emerald, omelette, ramekin pronounced /'remkin/ 
<ne> spelling /n/ is needed for heroine, etc., as well as for 
confectionery, stationery, etc. 
<pe> spelling /p/ is needed for cantaloupe, troupe as well as for 
opera, operable, twopenny, etc. 
<se> spelling /z/ is needed for gooseberry as well as miserable 
<si> is needed as the main correspondence for /3/ in vision, etc., as 
well as for /z/ in business 
<(t)te> spelling /t/ are needed for words like granite, route, gavotte, 
roulette, as well as for interest, literal, utterance, etc. 
<the> is needed to spell /6/ in words like soothe as well as /8/ in 
Catherine 
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<ve> spelling /v/ is needed for most words ending /v/ as well as for 
every, etc. 
<ze> is needed as a correspondence for word-final /z/ as well as for 
eczema. 
The other 32 sequences are: <ba, ca, da, ga, ma, na, pa, ra, Sa, ssa, ta, 
tau; be, she; fi, mi, pi, shio, tio; co, for, go, nou, po, so, sso, tho, to, vou, 
xo; fu, tu>. None of these are required by other parts of the analysis, so | 
have not added them to the inventory of graphemes. Even | find there are 
limits to the principle of accounting for every letter under the aegis of some 
phoneme or other (see section A.5 in Appendix A). 
So for some of the words in this section you can rely on related forms; 
for the rest you have no guidance but your visual memory. 
When the consonant letter-plus-elided vowel words are sorted 
alphabetically by the consonant letter, it becomes apparent that far and 


away the most common preceding letter is <t> - see the last paragraph of 
section 9.34. 


7. Special processes 


In this category | include processes which function outside the strict 
range of phoneme-grapheme correspondences but which are essential for 
understanding them. | have identified four: 

/r/-linking, which is dealt with in section 3.6 

elided vowels, which are dealt with just above in section 6.10 

dual-functioning 

surfacing sounds. 
The last two have been referred to frequently in previous chapters and are 
drawn together in the next two sections. 


7.1 Dual-functioning 


| have invented this term to cover cases where, in my opinion, particular 
letters belong to two graphemes simultaneously. In my analysis this process 
affects only the letters <e, r, w, y>. For more background on this see section 
A.8 in Appendix A. 


7.1.1 Letter <e> 


Dual-functioning <e> occurs where the word-final consonant digraphs 
<ce, ge, ve> overlap with split vowel digraphs and the <e> belongs to both. 
For details see variously sections 3.7.4, 3.7.6 and 3.8.4 for <ce, ge, ve>, 
and sections 10.4/17/24/28/38/40 for the split digraphs. Conversely, see 
section 3.7.8 for why | never treat the <e> in <-ze> as part of a split 
digraph. | have also found it unnecessary to treat the <e> in <-se> as part 
of a split digraph. 
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7.1.2 Letter <r> 


The major category of dual-functioning involving <r> is /r/-linking, or 
most of its instances - see Table 3.2 in section 3.6, where | point out which 
examples of /r/-linking do not count as instances of dual-functioning. 
Conversely, there are also cases of <r> having dual functions which are 
internal to stem words and therefore do not arise from /r/-linking. In all 
such cases the phoneme following the /r/ is a vowel: 
Words in which the word-initial morpheme air is followed by a vowel 
phoneme and is spelt <aer>, e.g. aerate, aerial, aerobic, aerodrome, 
aeroplane, aerosol. These all have the word-initial 2-phoneme sequence 
/ear/ in which /ea/ is spelt <aer> and the <r> also spells /r/ 
Many cases of medial /ear/ spelt <ar> in which the <r> functions 
as part of <ar> spelling /ea/ and also spells /r/, e.g. area, garish, 
gregarious, parent 
Two cases of medial /ear/ spelt <er> in which the <r> functions 
as part of <er> spelling /ea/ and also spells /r/, namely bolero 
(/ba‘learau/ ‘dance’), sombrero 
Two cases of word-initial /tar/ in which the <r> functions as part of 
<eer, eyr> spelling /1a/ and also spells /r/, namely eerie, eyrie 
Words in which medial /1ar/ is spelt <er> and the <r> functions as 
part of <er> spelling /1a/ and also spells /r/, e.g. adherent, cereal, 
coherence, coherent, ethereal, funereal, hero, inherent, managerial, 
material, perseverance, serial, series, serious, serum,  sidereal, 
venereal, zero 
Words in which medial /atar/ is spelt <ir> and the <r> functions as 
part of <ir> spelling /ata/ and also spells /r/, e.g. biro, giro, pirate, 
virus 
One word (and derivatives) in which initial /juar/ is spelt <eur> and 
the <r> functions as part of <eur> spelling /jua/ and also spells /r/, 
namely Europe 
urea and many words derived from it in which initial /juar/ is spelt 
<ur> and the <r> functions as part of <ur> spelling /jua/ and also 
spells /r/ 
Words in which medial /(j)uar/ are spelt <ur> and the <r> functions 
as part of <ur> spelling /(j)ua/ and also spells /r/, e.g. (with /uar/) 
during pronounced /'dgjuarin/, juror, jury, rural; (with /juar/) curate 
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(pronounced both /'kjuarat/ ‘junior cleric’ and /kjusa'rert/ ‘mount an 
exhibition’), curious, during pronounced /'djuarin/, fury, spurious. 
For longer lists see section 5.6.5 

One word in which medial /uar/ is spelt <our> and the <r> functions 
as part of that grapheme spelling /ua/ and also spells /r/, namely 
houri. Contrast potpourri, in which, uniquely, /ua, r/ can be analysed 
(admittedly counter-intuitively, but there is no call for a grapheme 
<ourr>) as Spelt separately as <our, r> 

Words in which initial or medial /a:r/ is spelt <or> and the <r> 
functions as part of <or> spelling /3:/ and also Spells /r/, e.g. aurora, 
authorial, borax, chlorine, choral, chorus, corporeal (only the second 
<or> since the first is followed by a consonant), decorum, dictatorial, 
editorial, euphoria, flora(), forum, glory, memorial, oracy, oral, 
oration, oratorio (only the medial occurrence - in my accent the initial 
phoneme is /p/, not /3:/), orient (noun, pronounced /'drritjant/ - the 
verb of the same spelling is pronounced /obri:'jent/), quorum, variorum. 


7.1.3 Letter <w> 


There are very few instances of dual-functioning <w>, and within stem 
words they are all medial and followed by a vowel phoneme: 
In ewer, jewel, newel, skewer, steward, <w> is both a single-letter 
grapheme spelling /w/ and part of the digraph <ew> spelling /(j)u:/ 
In bowie, rowan (in its English pronunciation /'‘rauwan/), <w> is both 
a single-letter grapheme spelling /w/ and part of the digraph <ow> 
spelling /au/ 
In bowel, dowel, rowel, towel, trowel, vowel, bower, cower, dower, 
flower, glower, power, shower, tower, coward, dowager, howitzer, 
prowess, plus the Scottish pronunciation of rowan /'rauwan/, <w> is 
both a single-letter grapheme spelling /w/ and part of the digraph 
<ow> spelling /au/. 
Other instances occur when words ending in <w>, which here is always part 
of a digraph spelling /(j)u:, au, au/, have a suffix beginning with a vowel 
phoneme added; also in running speech when such words are followed by 
a word beginning with a vowel phoneme. For examples and discussion of 
‘linking /w/’ see Table 3.7 in section 3.8.7 and the paragraphs preceding 
and following it. 
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7.1.4 Letter <y> 


Like <w>, there are very few instances of dual-functioning <y>, and within 
stem words they are all medial and followed by a vowel phoneme: 

In abeyance, /j/ is spelt <y> but the <y> is also part of <ey> 

spelling /er/ 

In arroyo, doyenne pronounced /do1'jen/, foyer pronounced 

/‘foijer, 'fotja/, loyal /‘lotjal/, Oyer (and Terminer), royal /'rd1jal/, soya, 

/j/ is spelt <y> but the <y> is also part of <oy> spelling /21/ 

In coyote /kar'jauti:/, doyenand doyenne pronounced /dwat'jen/, foyer 

pronounced /'fwatje1/, kayak /'katjek/, papaya, voyeur /vwat'j3:/, 

/j/ is spelt <y> but the <y> is also part of <ay, oy> spelling /at/. 
Other instances occur when words ending in <y> forming part of a digraph 
spelling /e1, 31, ir:/ have a suffix beginning with a vowel phoneme added; 
also in running speech when such words are followed by a word beginning 
with a vowel phoneme. See Table 3.8 in section 3.8.8 for examples and the 
paragraphs before and after it for discussion of ‘linking /j/’, including why 
| do not count <y> as a single-letter grapheme as having two functions in 
these circumstances. 


7.2 Surfacing sounds 


This is my term for phonemes which are absent in a stem word but present 
in one or more of its derived or associated forms. | have borrowed the term 
‘surfacing’ from transformational-generative grammar of yesteryear (and 
probably misapplied it). The great majority of the examples involve letters 
in stem-final position or immediately before that which are ‘silent’ (as 
conventional terminology has it) in the stem but pronounced when the word 
is suffixed; but there are also a very few initial examples - there are some 
amongst related forms of words with elided vowels in section 6.10. Most 
examples involve consonants, but there are a few involving vowels in final 
position. Some cases require detailed etymological knowledge. Actually, 
linking /r, w, j/ could also count here, but | have already dealt with them. 


7.2.1 Sounds which surface in stem-initial position 


In a few words with initial /n/ spelt <gn> the /g/ surfaces when the stem 
is prefixed: compare Gnostic, gnosis with agnostic, diagnosis, prognosis 
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In a couple of words with initial /n/ spelt <mn> the /m/ surfaces 
when the stem is prefixed: compare mnemonic, mnemonist with 
amnesia, amnesty (the etymological connection here is that all these 
words derive from the Greek word for ‘memory’) 

In one word with initial /s/ spelt <ps> the /p/ (and an etymologically 
related /m/) surface when the stem is prefixed: compare psychosis 
with metempsychosis 

Two of the few words with initial /t/ spelt <pt> are pterodactyl, 
pterosaur. The /p/ surfaces in archaeopteryx, helicopter 

In just one word where intial /n/ is spelt <kn> the <k>, assisted by an 
inserted <c>, surfaces as /k/: compare knowledge with acknowledge. 


7.2.2 Sounds which surface in medial position 


Initial /t/ is spelt <tw> only in two and derivatives, e.g. twopence, 
twopenny, and the /w/ surfaces in between, betwixt, twain, twelfth, 
twelve, twenty, twice, twilight, twilit, twin. In this case it would probably 
be more accurate to speak of the /w/ in two being ‘submerged’ since 
it is present in all those other words and would have been pronounced 
in (much) older forms of English, as <w> still is pronounced /f/ in the 
related German words zwei, zwo, zwanzig 

There are three words in which <t> forms part of <st> spelling 
medial /s/ in the stem but /t/ surfaces in derivatives: compare apostle, 
castle, epistle with apostolic, castellan, castellated, epistolary. For 
the converse of this, i.e. words in which stem-final /st/ spelt <-st> 
becomes /s/ after suffixation, see /s/, section 3.6.6 

There are two words in which <c> forms part of <sc> spelling medial 
/s/ in the stem but /k/ surfaces in derivatives: compare corpuscle, 
muscle with corpuscular, muscular 

In a few words with medial or final /t/ spelt <bt> /b/ surfaces in 
related forms: compare debt, doubt, subtle with debit, indubitable, 
subtility 

In one word with medial /k/ spelt <cu> the <u> surfaces as /ju:/ in 
a derived form: compare circuit with circuitous 

In one word with medial /g/ spelt <gu> the <u> surfaces as /w/ in 
two related words: compare languor with languid, languish 

In one word with medial /k/ spelt <qu> the <u> surfaces as /w/ ina 
derived form: compare conquer with conquest 
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Dicer 


In one word with final /t/ spelt <ct> /k/ surfaces in a derived form: 
compare indict with indiction (with change of vowel phoneme) - but 
not before inflectional suffixes in indicts, indicting, indicted 

In a few words with final /n/ spelt <gn> /g/ surfaces in derived 
or related forms: compare impugn, malign, sign with pugnacious, 
repugnant, malignant, assignation, designation, resignation, signal, 
signature (all with change of vowel phoneme) - but /g/ does not surface 
before inflectional suffixes, as in impugns, impugning, impugned, 
maligns, maligning, maligned, signs, signing, signed 

In three words with final /m/ spelt <gm> /g/ surfaces in derived or 
related forms: compare paradigm, phlegm, syntagmwith paradigmatic 
(with change of vowel phoneme), phlegmatic, syntagma(tic) - but /g/ 
does not surface in paradigms, phlegmy 

There is only one word with final /t/ spelt <pt>, namely receipt, and 
/p/ surfaces in reception, receptive (with change of vowel phoneme) 
In adverbs ending <-edly> derived from past participles ending in 
<-ed> pronounced /d, t/, <ed> is nevertheless pronounced /1d/, 
so the <e> has surfaced as /1/, e.g. determinedly, markedly. This 
also applies in a few nouns derived from such past participles, e.g. 
preparedness, and in a number of adjectives which are derived from 
or resemble past participles but have /1d/ rather than the expected 
/d, t/, but often with a different meaning, e.g. aged (/‘e1d31d/ ‘elderly’ 
vs /etdgd/ ‘having ... years’), dogged (/'dogid/ ‘persistent’ vs /dogd/ 
‘pursued’). For many more examples see sections 5.4.3 and 10.15 

In inherit the /h, r/ of heir both surface - or is this taking things too far? 


3 Sounds which surface in stem-final position 


In acreage, ochreous, ogreish, relative to the unsuffixed forms /r/ has 
surfaced in stem-final position, but /r/ and the preceding schwa seem 
to be represented by <r, e> in reverse order 

In actress, ambassadress, ancestress, conductress, dominatrix, 
executrix, foundress, laundress, ogress, protrectress, temptress, 
tigress, wardress (Supposing any of these forms except tigress 
are still PC; if you want to see how many other ‘feminine’ forms in 
<-ess, -ix> are now disused and deservedly forgotten, take a look 
in Walker’s Rhyming Dictionary), relative to actor, ..., warder, the 
schwas (and <e, o> which helped to spell them) have disappeared 
and /r/ has surfaced before the suffix 
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In the one word falsetto, supposing the connection with false is clear, 
it could be considered that the <e> surfaces as /e/ 

In accoutrement the final schwa of accoutre disappears and two 
phonemes surface: /r/ spelt <r> and /1/ represented by the first <e>. 
In several words with final /m/ spelt <mn> /n/ surfaces before 
derivational suffixes: compare autumn, column, condemn, damn, 
hymn, solemn with autumnal, columnar, columnist, condemnation, 
damnable, damnation, hymnal, solemnity - but not before inflectional 
suffixes or adverbial <-ly>, e.g. columns, condemned, damning, 
solemnly 

In a few words with final /m/ spelt <mb> /b/ surfaces in derived 
or related forms: compare dithyramb, bomb, rhomb, crumb with 
dithyrambic, bombard(ier), bombastic, rhomb-ic/us, crumble and 
supposedly, according to some authorities, thumb with thimble - but 
not before inflectional suffixes, e.g. bombs, bombing, crumbs 
Although long, strong, young end in /n/ (in RP) and are therefore 
to be analysed as containing /n/ spelt <ng>, the comparative and 
superlative forms longer, longest, stronger, strongest, younger, 
youngest and the verb elongate all have medial /ng/, so here /g/ 
has surfaced and is represented by the <g>, and /n/ is spelt <n>; 
similarly with diphthong, prolong when suffixed to diphthongise, 
prolongation - but /g/ does not surface before inflectional suffixes 
or adverbial <-ly>, e.g. longing, strongly 

But in longevity the surfacing phoneme is /d3/ 

There are several French loanwords in which final vowel phonemes 
are spelt with graphemes containing final <t> and /t/ surfaces when 
the stem is suffixed: compare ballet, debut, parquet, rapport, sabot, 
valet (also pronounced with /1t/) with balletic, debutante, parquetry, 
rapporteur, sabotage, saboteur, valeting. |In balletic, parquetry, 
sabotage, saboteur and (if the pre-suffixation ending is /er/) valeting, 
the vowel phoneme also changes 

There is one French loanword in which final /wa:/ is spelt <-ois> and 
the <s> surfaces as /z/ when the stem is suffixed: compare bourgeois 
with bourgeoisie. 


8. The graphemes of 
written English 


8.1 Choosing a written variety to analyse 


To match my decision to analyse the RP accent, | have chosen British rather 
than US spelling as the written variety of English to analyse. In practice, this 
makes very little difference, since there is far less variation in the spelling 
of English than in its pronunciation. The differences between British and 
US spelling make almost no difference to the analysis of the graphemes of 
written English - the same graphemes are used in both systems, just with 
different correspondences. 


8.2 How many graphemes, and how many 
correspondences? 


More troublesome than the minor differences between British and US 
spelling are the wide differences in opinion between experts on how many 
graphemes there are in written English. Wijk (1966: 14) says that the 
‘sounds of the spoken [English] language are normally represented by 102 
symbols in the written language’, but a great many Oddities are concealed 
behind that ‘normally’; also he does not count 15 doubled consonants, 
e.g. <bb, dd>. 

At the other extreme, Mountford (1998: 109) says he will ‘work with 
a combined set of some 235 consonant and vowel symbols’, which on 
inspection of his tables on p.113 turns out to be more precisely 238 
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graphemes, which are involved in 407 correspondences. In his figure for 
graphemes Mountford includes many quite rare graphemes not counted 
by Wijk, but even Mountford admits that there are others which might be 
counted but which are so rare and marginal that they are not worth the 
bother. His example (p.112) is the possible grapheme <schsch> spelling 
the phoneme /J/ only in the rare word Eschscholtzia (the California poppy). 
Actually, both Wijk and Mountford are right, at different levels of 
analysis - the number of graphemes you recognise depends on how deep 
you go into the Oddities of the system (and on various technical decisions 
- I’ve summarised mine in Appendix A). 
| am going to provide three estimates of the number of graphemes by 
counting: 
1) all and only those which appear in what I’ve called the ‘main system’ 
in chapters 3 and 5; 
2) the rest, including the minor patterns and Oddities; 
3) both. 
All the graphemes which appear in both the main system and the rest 
in chapters 3 and 5 are listed in Tables 8.1-2, which cover graphemes 
representing consonants and vowels respectively. Both contain relevant 
2- and 3-phoneme graphemes; each of these appears more than once, 
either within the same Table or across the two. In both, the totals for 
correspondences show exactly how many entries there are in the relevant 
column, but those for graphemes show only the numbers of items which 
have not already appeared in the same column or a previous one. So in the 
‘Basic grapheme’ columns, <th> is counted only once among consonant 
graphemes, and <a, 0, 00, u> only once each among vowel graphemes. 
After those columns, for graphemes I’ve shown only the numbers of new 
items (indicated by + signs), with some subtotals. 


TABLE 8.1: ALL THE CONSONANT GRAPHEMES OF WRITTEN ENGLISH, 
BY RP PHONEME. 


The Table includes not only graphemes for single consonant phonemes, 
but also those for 2- and 3-phoneme sequences involving consonant 
phonemes. The consonant phonemes are listed in the same order as in 
chapter 3. 

(For simplicity, almost all angled brackets indicating graphemes are 
omitted). 
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= > -| ydd yb y a A os} i = yd 4 /4/ 
- -| yos} ays | 1 Yy} 9} Z9 19 99 ) - yr} - 1 ip) /f/ 
IX X /Pi/ 
anb 
Wer /S>/ nb> nb >>| yy a x X yeds /s>/ 

x /s9/ - yo> | yb nd bd yd 99 6 - 2 b oT fs) // 
- - - ysl 4JM Ud 4 - - dd - - d /4/ 

ZZZ /s1/ ma uy 
yi /ei/ | yaud -| a1adappaq - an Hn - pa 1 /y/ 
- - - - ud ad yb dq q odd dd - - d /d/ 

ub /fu/ aub | ud mu qu 6u au 
u /ue/ - oup pu uw u> ub - ouu uu - - u /u/ 

pu 
wi /we/ -) aqui | uw ow qu wb - OWL WLW - - wi /w/ 

x /<6/ 
UX X /z6/ | nby> anb nb ub - - 66 - - 6 /6/ 
= = -}| Upp YP ep pq = 7 pp po p /p/ 
cs ae - = qd nq yq - = qq TT = q /q/ 
sewayder5 s2uenpos euleMoud y 2 é t <a> + Buyjads Buijads sowaydesb saweydeib | awaydeib 
soawaydesb awauoyd-¢ B -Z $191}3] JO Jaquinu Aq ‘sanippo pejqnoqg pajqnoq asey | juanbay 42410 diseg awau0ud 
1s94 YUL wajsAs ulew ay 
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IDs ys ss fs 
IX X /Pr/ -| syd eyd | ads yd] 159 e/u e/u Ce) Iss 1s 1D ys /f/ 
- -| anbu | nbu ybu pu du - e/u e/u - u bu /u/ 
- - - - ym e/u e/u - - y /y/ 
s /z1/ az $} 
Ux X /z6/ - -| sS9S ZD x - ZZ - ass Z /z/ 
AA 
- - - - ud aq - - dA - 4} A /A/ 
YX ox 
x yas MS 1s x 
ZZZ yds aos | 9s sd 9D Z1 ass ss yads /sy/ as 909 s /s/ 
Il 
| a -| 4p ay [6 7 all ll -| ap yeds /je/ /\/ 
([ 16 66 
= - az -| [Pp Ip yp p -| 26p 6p - a6 6 f /$p/ 
a2uanbas 
syeuesS ee is 2 z l <a> + Buryjeds Buljjeds soweydesb sawaydesib awaydeib 
sawaydeib awauoyd-¢ B -Z $491}9] JO Jaquinu Aq ‘sanippoO pejqnoqg pejqnoq arey | juanba.y 42410 diseg awau0ud 
1sa4 SUL wajsAs ulew ays 
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O8L= eZ= Soauep 
9E S eZ 76 81 Z Zl 9 92 vz | -uodsaiiod 
80L= 8S9= 
61+ b+ 61+ 6S4 Ot Z Zi+ z+ 91+ €zZ | sawaydes6 
ub 
ll 
ain enn na /ef/ 
ain an ina /eal/ an ‘n ‘ma 
nn 4nin an yads /inf/ 
ama na nea /xnf/ - - ll fy e/u e/u | yeds /rinf/ | A /f/ 
410 /erem/ 
Ao /TemM/ 
slo 
2410 410 10 /:0M/ rn 
{e) /VM/ - - no ny - e/u e/u - ymMn M /M/ 
- - - ayi - 7 e/u e/u 7 7 yi /Q/ 
ui /ev | uaud oul = = e/u e/u = = yi /0/ 
IZ 
x /£6/ - - asd | zfb e/u e/u sob - Is /&/ 
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TABLE 8.2: ALL THE VOWEL GRAPHEMES OF WRITTEN ENGLISH, BY RP 
PHONEME PLUS /ju:/. 


The Table includes not only graphemes for pure vowel phonemes and 
diphthongs, but also those for 2- and 3-phoneme sequences involving vowel 
phonemes. The vowel phonemes are listed in the same order as in chapter 5 
and, as there, the special 2-phoneme sequence /ju:/ is included in the main 
list. (For simplicity, angled brackets indicating graphemes are omitted). 
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ysse dea dane ye 
SIO 9410 410 10 /:0M/ ase dze ase see | se je yeare ee - e Je /x0/ 
410 | /erem ‘eIc/ 
dno snoy /ene/ 
adA JA dul JI /ere/ 
ain ennna /ef/ 
r jue/ 4A an 
aun en a1 MO NO 
m4 due / 4no 4Je0 Ana JO 10 O| eI na 
| /\e/ ybno ned aja Due Oa Ja se re ye Ant | a, yeds /je/ O439 e /e/ 
- - [no no JO fe) - n 00 /a/ 
{e) /VM/ no 00 30 - fe) n /v/ 
- - nea ye MO NO oY ne i) - e {o) /a/ 
s /z1/ eay | IMalellaavvie | noe - Aa | /1/ 
a1 09 
x /s2/ la ea Ae te ae ne - - 3 /2a/ 
- - jaye re oe | - - e /z/ 
aouenbes 
saweydeiy | ewauoud v € 4 L sowaydesb sowaydeib | awaydeib 
sowaydesb awauoyd-¢  -Z sia}9| JO Aaquinu Aq ‘sanippo aiey | JUaNbasy 4210 dIseg aweUu0ud 


ySo1 OY 


waisAs ulew oy 
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aun 
ain in ina /eal/ 4Jno 400 Ana dn - - - /ea/ 
42! 
- - 24,3 JAd dla JI - a49 Jd 190 Jeo /er/ 
Alay a4,Aa 
adda JOAe aa da 
- - dake are | 13,9 yea sae Ja oe - dea se use aie /ea/ 
4no snoy /emae/ ybno Moe ne - MO no /ae/ 
410 /eic/ Me - Ao 10 /Ic/ 
4no 
Uo 140 duo 
a4,n0 yBno | JOO 4130 1eO 
- -| sdio yBne | ame ine jne eo |e -| some ne see JO [xc/ 
dun ino 
Ojo wu une 
- - yssA 449 349 1ed JA na - dn JO JI Ja /x€/ 
auenbas sowaydes6 
seusydes: | euruoyd v € 4 L soweydesb quanbai | swaydesb 
sowaydeib awauoyd-¢ B -Z $19}}9] JO Jaquinu Aq ‘sanippo aey 42410 dIseg awWaUu0ud 


ysa4 UL 


woajsAs ulew oy 
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9rz= saduap 
92 02 OZ 9OLL vl 89= S Ev 02 | -Yodseai10> 
60l= LLt+ SL+ 6+ vet 0} ZE= vt Zi+ 91 | sawaydesb 
xno 3no sno 
dno ano nn inno 
- - yBno | yoo nao nal | 3°0 30 na 9a an an noma 00 /in/ 
- - ama nea nn qjninno an an Ma n /xnf/ 
no jo so 
aMO 3JO | OO JO YO 30 
- - yBno | yeo nea yore | eo Ma O9 ne - MO 3'O fe) /ne/ 
410 /erem/ 
aA ah 
ae (AEM) In Ao st 41 3! 
dA JA ddl dl /ere/ yBia | ada ake sie | et Aad ja le ae e - A ub! al /te/ 
30 USI a1 Ad 
- - SIO 09 Ja Ae oe al o'9 Aleao 30 /u/ 
za Ao 
ya sada le 
ake d°d 9d A 
- -| ypbla ybre jye we sre | ne oe ye oe r) - Ae tee ave /19/ 


262 Dictionary of the British English Spelling System 


Discounting duplicates (including those involving 2- and 3-phoneme 
graphemes), in Table 8.1 there are 58 graphemes and 73 correspondences 
in the main system, and 108 graphemes and 180 correspondences in the 
rest, making a total of 166 graphemes and 253 correspondences in which 
consonant phonemes are involved. 

On the same basis, in Table 8.2 there are 37 graphemes and 68 
correspondences in the main system, and 109 graphemes and 246 
correspondences in the rest, making a total of 146 graphemes and 314 
correspondences in which vowel phonemes are involved. 

However, adding together the numbers in the two preceding paragraphs 
does not yield correct overall totals because several graphemes and 
correspondences appear in both Tables. Thus <i, u, y> occur as both 
consonant and vowel graphemes, and some 2-phoneme sequences and 
both 3-phoneme sequences represented by single graphemes contain both 
consonants and vowels. De-duplicating these complications reduces the 
number of graphemes by 28 (6 in the main system, 22 in the rest) and the 
number of correspondences by 24 (3 in the main system, 21 in the rest). 

The full analysis therefore yields totals of: 

89 graphemes and 138 correspondences in the main system 

195 graphemes and 405 correspondences in the rest, and 

284 graphemes and 543 correspondences overall. 
Thus my analysis has led to distinctly higher totals even than Mountford’s 
238 graphemes and 407 correspondences. This is mainly because | have 
included a lot of correspondences found only in small numbers of more 
recent French loanwords which he did not include. 


8.3 The graphemes of the main system and 
the rest 


Alphabetical lists of the 89 graphemes of the main system and of the 195 
others are provided in Tables 8.3 and 8.4 respectively. Theoretically it 
should be possible to spell any English word using just the 89 graphemes 
of the main system and their 138 main-system correspondences, since 
they cover all 44 phonemes and allow for different positions in the word 
and various other constraints. However, from my analysis and every other 
author’s it is abundantly clear that the full system is much more complex 
- and, to give just one example, trying to spell schwa consistently as <er> 
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in stem-final position and <a> elsewhere would probably produce many 


confusing spellings. 


Table 8.3 shows that there are, of course, 26 single-letter graphemes 
in English spelling; they all belong to the main system. The numbers of 


graphemes of all sizes in the main system and the rest are: 


main system the rest total 
single letters 26 0 26 
digraphs 53 118 171 
trigraphs 10 57 67 
four-letter graphemes 0 20 20 
total 89 195 284 


Simplified versions of the tables of correspondences are provided in Appendix 


B: they are intended to be much more useful to teachers and to writers of 


early reading books than the comprehensive versions in Tables 8.1-2. 
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TABLE 8.3: ALPHABETICAL LIST OF THE 89 GRAPHEMES OF THE MAIN SYSTEM. 


a a.e ai air ar | are | au aw | ay 
b | bb 
ce ch ci ck 


c 
d | dd dg | dge 


e |/ea ear | ed ee |e.e | eer | er |ere | ew 
f | ff 

g | ge gg 

h 

i ie i.e igh ir 

j 

k 

| le II 

m | mm 

n | ng nn 

Oo | Oe oi oo or | ore | ou | ow | oy 
p | ph | pp 

q 

r rr 

S se sh si ss | ssi 
t tch th ti tt 

u_ | ue u.e | ur 

v | ve 

w | wh 

x 

y 
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TABLE 8.4: ALPHABETICAL LIST OF THE OTHER 195 GRAPHEMES. 

aa aar ach | ae aer ah aigh | aire | ais | ait | al alf 
anc | ao aoh aow | arr arre | arrh | as at augh | aul 
aur | awe | aye ayer | ayor 

bh | bd bp bt bu bv 

cc cch | che | chs ckgu | cq cqu | ct cu CZ 

de | ddh | dh ddh | di dj dne | dt 

eah | eau | e’er | ei eigh | eir eo e’re | err | erre | es et 
eu eur | ewe | ey eye eyr | ey’re | ez 

fe ffe ft 

gh | gi gl gm gn gne gu gue 

hea | heir | ho hour | hu 

ia ier ieu io ire irr is it 

jj 

ke kh kk kn 

lh lle 

mb | mbe | me mme | mn 

nc nd ne ngh ngu | ngue | nne | nt nw 

oa | oar | oat oe oer oeu oh oir oire | ois | ol olo 
ooh | oor | orp orps | orr ort | os ot oue | ough} oul 
oup | our | ou’re | ous’ | out oux | owe 

pb | pe phth | pn ppe |pph | ps | pt 

qu | que 

re rh rrh 

SC sce sch sci sj sse st sth sw 

te the | ts tsch | tte tw 

ua ui ure urr ut uu 

w 

wi wr ww 

xe xh xi 

ye y.e yr yre yrrh 

ze zi 


9. The grapheme-phoneme 
correspondences of English, 
1: Graphemes beginning 
with consonant letters 


A reminder: in chapters 9 and 10, the meanings of ‘initial’, ‘medial’ and 
‘final’ referring to positions in words are different from their meanings 
in chapters 3-7, which deal with the phoneme-grapheme direction: here 
they refer to positions in written words, since these chapters deal with the 
grapheme-phoneme direction. So, for instance, here the ‘magic <e>’ in 
split digraphs is described as being in word-final position, and consonant 
letters enclosed within split digraphs are in medial position. 


9.0 Unwritten consonant phonemes 


This is the appropriate place to recall that some occurrences of medial and 
linking /w/ (and possibly two of initial /w/) and a great many occurrences 
of medial and linking /j/ are not represented in the spelling at all - see 
sections 3.8.7-8. Following linguistic convention, these instances can be 
described as spelt by zero, hence the numbering of this section. Necessarily, 
as far as these cases are concerned, the rest is silence. 


9.1 General introduction to the grapheme- 
phoneme correspondences 


In chapters 9 and 101 present the grapheme-phoneme correspondences 
from British English spelling to RP using the inventory of 284 graphemes 
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listed in chapter 8. This chapter covers graphemes beginning with 
consonant letters, chapter 10 those beginning with vowel letters; for this 
purpose <y> counts as a vowel letter. This arrangement is followed even 
with graphemes which begin with a letter of one category but always or 
mostly have correspondences with phonemes of the other category, e.g. 
<ed> in past verb forms, <le> in table, etc., and those which have both 
consonant and vowel pronunciations, especially <i, u, y>. 

The distinction between the main system and the rest which was arrived 
at in chapters 3 and 5 is maintained here, in mirror-image. That is, the 
only graphemes which are treated as part of the main system are the 89 
listed in Table 8.3, and the only grapheme-phoneme correspondences 
which are treated as part of the main system are the converses of the 138 
phoneme-grapheme correspondences involving those graphemes in the 
‘main system’ columns of Tables 8.1-2; this principle is maintained even for 
correspondences whose frequencies in this direction are low. 

Other grapheme-phoneme correspondences which involve the 89 
main-system graphemes are treated as exceptions to the main system 
(even where their correspondences in this direction have high frequencies). 
Because some correspondences which are frequent in the phoneme- 
grapheme direction are rare in the grapheme-phoneme direction, and vice 
versa (which indicates both a mismatch between the two directions and 
therefore a basic misdesign in the overall system), in chapters 9 and 10 | 
have abandoned the distinction between frequent and rare correspondences 
for the main-system graphemes. However, most minor correspondences 
are again treated as Oddities. (For exceptions to the last statement, see 
sections 9.5 and 10.2). 

Across chapters 9 and 10 all 89 graphemes of the main system are 
covered. However, there are only 76 entries. This is mainly because of the 
12 principal doubled consonant spellings which consist of two occurrences 
of the single letter which spells the same phoneme: <bb, dd, ff, gg, II, mm, 
nn, pp, rr, ss, tt, zz> (the 13" is because <dg, dge> share an entry). Since 
these ‘geminates’ have hardly any pronunciations different from the basic 
one of the corresponding single letter, and their two letters hardly ever 
belong to separate graphemes (for the only exception | know of, see Notes 
to section 9.15), each is covered within a joint entry with the single letter. 

Otherwise, in both chapters the graphemes of the main system are listed 
in alphabetical order, with cross-references to show where those consisting 
of more than one letter are not covered under their initial letter. Minor 
graphemes are listed under the appropriate main-system grapheme, e.g. 
<bd> under <b>, <ae> under <a>, <err> under <er>, etc. 
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[For compulsive counters: 2- and 3-phoneme graphemes are treated 
differently in these two chapters from chapters 3 and 5. There, each such 
grapheme was logged under each of the relevant phonemes. Here, each 
such grapheme is logged only once, under its initial letter. But the total 
number of correspondences remains the same.] 


9.2 When is a digraph not a digraph? 


(Parallel questions apply to trigraphs and four-letter graphemes - see for 
example the competing possibilities for word-final <che> (section 9.9), the 
discussions of <gh> in the entry for <g, gg> (section 9.15), <ough> in the 
entry for <ou> (section 10.33), the discussion of vowel letters ‘in hiatus’ 
(section 10.42), and the paragraph beginning ‘When is a split digraph not a 
split digraph?’ in Appendix A, section A.6). 

Some sequences of more than one letter which form main-system 
graphemes never or hardly ever occur except as those graphemes - a clear 
example is <ck>. Others have exceptions only at morpheme boundaries 
within words, e.g. <t, h> in a few words like carthorse, meathook, <o, o> 
in cooperate, zoology. Other main-system graphemes again occur only in 
restricted positions, so that all other occurrences of the same sequence of 
letters contain more than one grapheme - see for example <ce> (section 9.8). 
| attempt to give clear guidance related to each main-system grapheme (and 
in section 9.44 state a generalisation about the six graphemes other than 
<sh> which are pronounced /Jf/), but in the end effectively have to assume 
that a human reader (as distinct from a computerised text-to-speech 
system) can recognise both morpheme boundaries within compound words 
and multi-letter graphemes within stem words. Carney (1994: 286-7) states 
the same assumption. 

| also assume that readers of this book will realise that other occurrences 
of the sequences of letters which constitute minor multi-letter graphemes 
follow the general rules; therefore | do not waste space saying (for example) 
‘Occurrences of <p, s> other than word-initially consist of separate 
graphemes’. Conversely, where a correspondence for a single letter is said 
to be ‘regular’, this does not include cases where the letter forms part of 
another grapheme; for example, ‘<c> is pronounced /s/ before <e, i, y>’ 
does not include its di/trigraphic occurrences in <ce, ci, sci>. 

Both assumptions work better for graphemes beginning with consonant 
letters than for those beginning with vowel letters - but that is true of 
generalisations for the two sets of graphemes as a whole. 
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9.3 Frequencies 


The frequencies in these chapters are derived from Gontijo et a/. (2003). They 
used a corpus of 17.9 million words (the CELEX database, Version 2.5, Baayen et 
al., 1995) in which both the British spelling and (a computerised version of) the RP 
pronunciation of every one of the 160,595 different words is represented. (The 
authors do, however, point out that 2,887 lines (1.8%) in the database contain 
multi-word expressions, of which the longest is European Economic Community, 
hence the number of lines with unique single words is actually 157,708.) Gontijo 
et al. based their graphemic analysis on that of Berndt et al. (1987), which was 
based on a corpus of only 17,000 words in US spelling, but adapted it for British 
spelling and expanded it to deal with rarer graphemes as their analysis proceeded. 
Ultimately, Gontijo et a/.’s database contained a set of 195 graphemes and 461 
grapheme-phoneme correspondences. While these numbers are rather smaller 
than my overall totals of 284 graphemes and 543 correspondences (see section 
8.2), most of the ‘missing’ graphemes and correspondences are rare and would 
only be found by a total spelling nerd (= me). 

As will be apparent, Gontijo et a/. used a different corpus from Carney. Also, 
unlike Carney, they did not lemmatise their corpus (= remove suffixes and 
reduce words to their stem forms); nor did they ignore high-frequency words 
like of, there, where. However, they did relate the number of occurrences of 
a grapheme to the number of times each word appeared in the database - 
that is, they calculated text frequencies rather than lexical frequencies - see 
the discussion in section 3.3. Even so, their frequencies are not the mirror- 
image of Carney’s. Producing mirror-image frequencies would require using 
exactly the same database, the same set of conventions (especially whether 
to lemmatise or not), and the same set of graphemes for the analyses in both 
directions. Such an analysis has yet to be undertaken. 

Having established their sets of graphemes and correspondences, Gontijo 
et al. calculated the number of occurrences of each grapheme and its 
frequency within the whole database, and the frequency of every grapheme- 
phoneme correspondence as a subset of all the correspondences for the 
relevant grapheme For example, they calculated that: 

grapheme <a> accounted for 3,746,713 of the total of 67,590,620 
grapheme occurrences 

<a> therefore represented 5.55% of all the grapheme occurrences in 
the database 

<a> pronounced /a/ occurred 591,123 times, and that correspondence 
therefore represented 15.8% of the 3,746,713 correspondences for 
grapheme <a>. 
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To arrive at the percentages presented in chapters 9 and 10, | have modified 

Gontijo et al.’s results in various ways. To give just two examples: 

1) The way they (and Mountford, 1998) analysed word-final <e> resulted in far 
too many split digraphs, trigraphs, etc.; e.g. they treated <a, e> in collapse 
as an example of <a.e>. In my opinion, <a, e> here are better analysed 
as <a> pronounced /z/ and <e> as part of <se> pronounced /s/; 

2) Their system recognised too few graphemes ending in <r>, e.g. <air> 
in dairy is split into <ai> pronounced /ea/ and <r> pronounced /r/, 
whereas my analysis posits that the <r> in such cases is not only a 
grapheme in its own right spelling /r/ but also part of <air> spelling 
/ea/ - see sections 5.6.3, 7.1 and 10.6, and section A.8 in Appendix A. 

Rather than listing all the differences between my calculations and Gontijo 

et al.'s, let me just say that, where | could, | have re-allocated sets of words 

and correspondences in accordance with my analysis, and then re-calculated 
the frequencies of the correspondences within graphemes. 
The outcomes are that: 
| give no percentages for a large number of minor graphemes, those 
which have only one pronunciation and for which it would be otiose to 
keep saying ‘100%’. This applies to 154 of the 195 minor graphemes 
across these two chapters 
for the 41 minor graphemes with more than one pronunciation | give 
percentages only in the few cases where Gontijo et al.’s data provide 
them, otherwise not 
| give separate percentages for the correspondences of as many 
main-system graphemes as possible, including (again, where Gontijo 
et al.’s data provide them) for the minor correspondences of such 
graphemes, e.g. under <ch>; for the main exceptions to this see the 
first paragraph of section 10.1. 


9.4 The general picture: the regular 
pronunciations of English graphemes 
beginning with consonant letters 


This chapter contains 38 main entries for graphemes beginning with 
consonant letters, in alphabetical order, even though Table 8.1 lists 58 
graphemes spelling consonant phonemes in the main system. The reasons 
for the discrepancy are: 
as mentioned above, the 12 geminate spellings have joint entries with 
the single letters, and <dg, dge> have a joint entry 
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all the correspondences for <ed, ew, i, u, ue, u.e, y>, Consonantal, 
vocalic and 2-phoneme, are covered in chapter 10. 
For the 51 main-system graphemes covered in this chapter, the general 
picture can be summed up as follows: 
The 21 graphemes listed in Table 9.1 have only one pronunciation 
each (except for one tiny exception under <b>): 


TABLE 9.1: 21 MAIN-SYSTEM CONSONANT GRAPHEMES WITH ONLY 
ONE PRONUNCIATION EACH. 


These graphemes 
are always pronounced as 
these phonemes 

b, bb /b/ 

ck /k/ 

dd /d/ 

dg, dge /d3/ 

ff /f/ 

k /k/ 

mm /m/ 

nn /n/ 

Pp, pp /p/ 

q /k/ 

rrr /r/ 

sh /S/ 

ssi * /J/ 

tch /¥/ 

tt /t/ 

ve * /v/ 

Ww /w/ 


* For these graphemes, the statement that they have only one pronunciation each 
involves defining the circumstances in which they constitute separate graphemes 
carefully; the rest are pronounced as shown in all positions in the word where they 
occur - this qualification is needed to recognise that several do not occur initially 
and others do not occur finally; all 21 occur medially. 
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The 20 graphemes listed in Table 9.2 have only one frequent 
pronunciation each: 


TABLE 9.2: 20 MAIN-SYSTEM CONSONANT GRAPHEMES WITH ONLY 
ONE FREQUENT PRONUNCIATION EACH. 


These graphemes 
are mostly pronounced as 
these phonemes 

ch // 
Ci* /S/ 
d /d/ 
f (ignoring of) /f/ 
gg /g/ 
h /h/ 
J / cB / 
I, Il /I/ 
le * /al/ 
m /m/ 
ng /y/ 
nn /n/ 
ph /f/ 
SS /s/ 
ti /S/ 
Vv /v/ 
wh /w/ 
Z, ZZ /z/ 


* Forthese graphemes, the statement that they have only one frequent pronunciation 
each involves defining the circumstances in which they constitute separate 
graphemes carefully; the rest are pronounced as shown in all positions in the 
word where they occur - this qualification is needed to recognise that several do 
not occur initially and others do not occur finally; all 20 occur medially. 
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The nine graphemes listed in Table 9.3 have two main pronunciations 
each, and the circumstances in which the two pronunciations occur 
can be defined quite closely: 


TABLE 9.3: NINE MAIN-SYSTEM CONSONANT GRAPHEMES WITH 
TWO REGULAR PRONUNCIATIONS EACH. 


This grapheme has these two main 
pronunciations 

Cc /k, s/ 

ce /s, J/ 

g /9, c3/ 

n /n, 9/ 

se /Z,s/ 

si /3,J/ 

t /t, Y/ 

th /8. 0/ 

x /ks, z/ 


<s> is the only main-system grapheme beginning with a consonant letter 
which is a major problem: it is mainly pronounced /s/ but has lots of 
exceptions (mainly where it is pronounced /z/) for which no rules can be 
stated, especially in medial position. 

This means that 41 of these 51 graphemes have only one, or only one 
frequent, pronunciation, and the other 10 have only two main pronunciations 
each; none have more than two main-system pronunciations. 

For completeness, it should also be noted that many minor consonant 
graphemes also have highly predictable pronunciations, e.g. word-final 
<que>. In fact, of the 107 graphemes beginning with consonant letters that 
are outside the main system, only 12 <cc che cz gh gn mn nd phth sc sch te 
xh> have more than one pronunciation. In any attempt (not made here) to 
estimate the overall regularity of the system this would need to be taken into 
account. However, many minor graphemes are so rare that they would not 
affect the regularity calculation unless they occur in high-frequency words. 

To complete the picture for graphemes beginning with consonant 
letters, Table 9.4 lists all 51 of them and shows their main-system and 
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minor correspondences and numbers of Oddities. Table 9.4 is almost but 


not quite the mirror-image of Table 8.1 because: 


graphemes which begin with consonant letters but vowel phonemes 


(e.g. <ho> in honest) are included here; 


graphemes which begin with vowel letters but consonant phonemes 


(e.g. <ue> pronounced /ju:/) are not included here but in Table 10.1. 


TABLE 9.4: MAIN-SYSTEM GRAPHEMES BEGINNING WITH CONSONANT LETTERS, BY 
MAIN-SYSTEM AND MINOR CORRESPONDENCES AND NUMBERS OF ODDITIES. 


Main system The rest 
Grapheme Basic Other main- Exceptions to main | Number of 
phoneme | system system (minor Oddities * which 
correspondences | correspondences) the grapheme 
‘leads’ 
b /b/ /p/ 6 
bb /b/ 
c /k/ /s/ /S tf/ 12 
ce /s/ /S/ 
ch /tf/ /k f d3/ 3 
ci /S/ /f 3/ 
ck /k/ 1 
d /d/ /o3/ 7 
dd /d/ 1 
dg /d3/ 
dge /d3/ 
f /f/ lv/ é 
ff /f/ 1 
g /g/ /o3/ /k3/ 12 
ge /d3/ /3/ 
99 /g/ /c3/ 
h /h/ /j/ 5 


* including 2- and 3-phoneme pronunciations and doubled spellings which are 
not part of the main system. 
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TABLE 9.4: MAIN-SYSTEM GRAPHEMES BEGINNING WITH CONSONANT LETTERS, BY 
MAIN-SYSTEM AND MINOR CORRESPONDENCES AND NUMBERS OF ODDITIES, CONT. 


Main system The rest 
Grapheme | Basic Other main-system | Exceptions to main | Number of 
phoneme | correspondences system (minor Oddities * which 
correspondences) the grapheme 
‘leads’ 
J / 3 / /j3h/ 1 
k /k/ 4 
/l/ /al/ 1 
le /al/ Al/ 
I /I/ /j\j/ | 
m /m/ /am/ 5 
mm /m/ ] 
n /n/ /q/ /an/ 7 
ng /n/ /n/nk/ 3 
nn /n/ ] 
p /p/ 5 
ph /f/ /pv/ 2 
pp /p/ 2 
q /k/ 2 
r /r/ 3 
rr /r/ 2 
s /s/ /Z3/ /S/ 12 
se /s/ /Z/ 3 
sh /J/ 
si /3/ /S/ /Z/ 
ss /s/ /Sz/ 1 
ssi /Jl 
t /t/ /tf/ /S s/ 5 
tch /tf/ 
th /6/ /8/ /t tf t0/ 2 
ti /S/ /tf 3/ 
tt /t/ 1 
v /v/ /f/ 1 
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ve /v/ 

w /w/ 2 

wh /w/ /h/ 

x /ks/ /z k gz kf g3 eks/ 4 

z [z/ /s3ts/ 2 

Zz [z/ /ts/ 

Total 51 12 49 123 

51 63 172 
Grand total of correspondences: 235 


* including 2- and 3-phoneme pronunciations and doubled spellings which are 
not part of the main system. 


9.5 Order of description 


In most of the 38 main entries in this chapter | list the items in this order: 

1) The basic phoneme. In my opinion, each of these graphemes has a basic 
phoneme, the one that seems most natural as its pronunciation. Where 
the basic phoneme is the only pronunciation of the grapheme it is labelled 
‘Only phoneme’. Where a geminate spelling always or mostly has the same 
pronunciation as the single letter they are shown together. However, 
there are five geminate spellings which are minor graphemes: <cc, jj, kk, 
vw, ww> - these are listed under Oddities below the single letter. <hh> 
occurs too, but only at the morpheme boundary in compound words, e.g. 
witchhunt, and <q, x> appear doubled only in brand names or foreign 
words. These three are therefore mentioned only to exclude them. 

2) Any other phoneme which counts as a main-system pronunciation of 
the grapheme, as defined above. Where there are no such phonemes 
this subheading is omitted. 

These two categories constitute the main system for grapbheme-phoneme 

correspondences for graphemes beginning with consonant letters. 

Correspondences in the main system are shown in 9-point type, the rest in 

smaller 7.5-point type. 

3) Any doubled-letter grapheme which is not part of the main system (this 
sub-heading is also omitted where it is not relevant). 

4) Exceptions to the main system, including any 2- or 3-phoneme 
correspondences for the main grapheme(s). The reason for listing 
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exceptions to the main system separately from the Oddities is that this 
is the clearest way of showing where the main rules break down. 

5) The geminate spelling plus final <e>, if it occurs. Where it might but 
does not, | say so; elsewhere | omit this heading. 

6) Oddities, minor graphemes which begin with the letter(s) of the main 
grapheme and occur only in restricted sets of words. 

7) Any 2- or 3-phoneme graphemes which include, but do not have 
entirely the same spelling as, the main grapheme. Almost all the 2- and 
3-phoneme graphemes are also Oddities, but a few belong to the main 
system and are included there. 

Most entries end with Notes, and two (<s, se>) have Tables. 

The only exceptions to this ordering are 15 of the graphemes which have 
only one pronunciation each: <b, bb, ck, dg, dge, k, p, pp, q, r, rr, sh, ssi, 
tch, ve>. Under each of these there is just one heading, ‘Only phoneme’, 
and it is automatically part of the main system without having to be so 
labelled; however, most of these entries have Notes. The other 6 graphemes 
which have only one pronunciation each (<dd, ff, mm, nn, tt, w>) have/are 
within more extended entries. 

Where a grapheme cannot appear in all of initial, medial and final 
positions there is usually a note to this effect at the head of its entry, with 
this exception: because doubled consonant spellings never occur word- 
inirially (except <II> in Hama, Ilano), the headings where doubled spellings 
appear are not labelled to this effect. 


9.6 <b, bb> 


THE MAIN SYSTEM 


Only phoneme (almost) /b/ 100% e.g. rabid, rabbit 
THE REST 
pronounced 
Exception to main system <b> /p/ only in presbyterian pronounced 


/prespr'trarisjan/ (also 
pronounced /prezbr'tiari:jan/), 
where the <b> devoices to /p/ if 
the <s> is pronounced /s/ 


Word-final doubled letter + <e> (does not occur) 
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Oddities <bd> /d/ only in bdellium /‘deli:jam/ 


<bh> /b/ only in abhor(red) /a'b>:(d)/, 
abhorrent /a'borant/, bhaji, 
bhang(ra), bhindi, Bhutan and a 
few other rare words from the 
Indian sub-continent. <b, h> 
are usually separate graphemes 
at a morpheme boundary, as in 
clubhouse, subheading 


<bp> /p/ only in subpoena /sa'pitna/ 


<bt> /t/ only in debt, doubt, subtle. /b/ 
surfaces in debit, indubitable, 
subtility - see section 7.2 


<bu> /b/ only in build, buoy, buy 
<bv> /v/ only in obvious pronounced 
/'ovisjas / 
2-phoneme graphemes (none) 


NOTE 


For <ba> in syllabary, and for <be> in deliberate, gooseberry /'guzbri:/), 
liberal, raspberry /'ra:zbri:/), strawberry /'stro:bris/), see section 6.10. 


9.7 <c> 


N.B. <ce, ch, ci, ck, tch> have separate entries. 


THE MAIN SYSTEM 
Basic phoneme /k/ 67% e.g. cat. Regular before <a, 
0, u> and consonant letters 
Other phoneme /s/ 30% e.g. city. Regular before <e, 
i, y> 
THE REST 
Exceptions to main system pronounced 3% in total 
<c> /k/ before <e, i, y> only in arced, 


arcing, Celt, Celtic (but the Glasgow 
football team is /'selttk/), sceptic, 
synced, syncing 
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Word-final doubled letter + <e> 


Oddities 


(which means that the spelling 
synch for this verb is better), 

and words beginning encephal- 
pronounced /enkefal-/ (also 
pronounced with /ensefal-/). 

Also, in July 2006 the superlative 
adjective chicest /'fi:k1st/ appeared 
on a magazine cover - the 
comparative would presumably be 


chicer 

<c>  /s/ other than before <e, i, y> only in 
apercu, facade (lacking their French 
cedillas) 

<c> /J/ only in officiate, speciality, specie(s), 


superficiality and sometimes 
ap/de-preciate, associate. See Notes 


<c> = /tf/ only in cellist, cello, cicerone (twice), 
concerto (second <c>) 


(does not occur; in 
recce <cc, e> are 
separate graphemes) 


<cc> /ks/ almost 100% before <e, i, y>, 
where (following the general rules 
for <c> above) the two letters are 
separate graphemes, e.g. accent, 
occiput, coccyx. This entry, with 
2 graphemes corresponding 
separately to 2 phonemes, strictly 
speaking does not belong in this 
book based on correspondences 
to and from single graphemes, 
but it has to be included for 
clarity over the single-phoneme 
correspondences of <cc> in 
the next four paragraphs; 
<cc> pronounced /ks/ is not 
counted in the overall totals of 
correspondences 


<cc> /tf/ before <e, i> only in bocce, 
cappuccino. There are no 
occurrences of <cc> pronounced 
/t{/ before <y> 
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<cc> /s/ 
<cc> /k/ 
<cc> /k/ 
<cch> /k/ 
<cq> /k/ 
<cqu> /k/ 
<ct> /t/ 
<cu> /k/ 
<cez> /tf/ 
<cz> /z/ 
2-phoneme graphemes (none, but see <cc> 


pronounced /ks/ 
under Oddities) 


NOTES 


before <i> only in flaccid, succinct 
pronounced /'flasid, sa'sinkt/ 
(also pronounced (regularly) 
/‘fleeks1d, sak'stnkt/). There are no 
occurrences of <cc> pronounced 
/s/ before <e, y> 


before <e, i, y> only in baccy, 
biccy, recce /'reki:/ (short for 
reconnoitre), soccer, speccy, 
streptococci 


100% before <a, 0, u>, e.g. 
occasion, account, occur 


only in bacchanal, Bacchante, 
bacchic, ecchymosis, gnocchi, 
saccharide, saccharine, zucchini 


only in acquaint, acquiesce, 
acquire, acquisitive, acquit, with 
the <u> being pronounced /w/ 


(not /kw/) only in lacquer, picquet, 
racquet 


only in Connecticut, indict, 
victualler, victuals. /t/ surfaces in 
indiction - see section 7.2 


only in biscuit, circuit 


only in czardas /'t{aidef/, Czech 


/tfek/ 


only in czar(ina) /zax(‘ri:na)/ 


Given the small numbers of words in which the major correspondences do 


not apply, those two correspondences stated context-sensitively mean that 


pronunciations of <c> as a single-letter grapheme are 97% predictable. 


Medial <c> pronounced /{/ is always followed by <i(e)>, but the <i(e)> 


is a separate grapheme pronounced /i:/. Some of the relevant words have 


alternative pronunciations with /s/, e.g. appreciate as /a'pri:fizjert/ or 


/a'pritsisjert/, associate as /a'sausitjert/ or /a'sausisjert/ (taking associate 
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as a verb; the noun of the same spelling ends in /at/), species as /'spi:fi:z/ 
or /'spizsirz/. However, when verbs ending in <-ciate> are nominalised 
with the suffix /an/ spelt <-ion>, which compulsorily changes the final 
/t/ of the verb to medial /J/, in many RP-speakers’ accents a phonological 
constraint seems to operate against medial / J / occurring twice; for example 
appreciation, association are pronounced /apritsi:'jerfan, a'sausi:'jerfan/, 
not /aprisfis'jerfan, a'sausis'jerfan/. 

For <ca> in adverbs ending <-ically>, which is always pronounced 
/tkliz/, apothecary and forecastle pronounced /'fauksal/, and for <co> in 
chocolate, decorative, see section 6.10. 


9.8 <ce> 


Never initial. 


THE MAIN SYSTEM 


For both categories and for estimated percentages see Notes. 


Basic phoneme /s/ except ina few suffixed forms (see section 6.4), 
only word-final, e.g. fence, once, voice. In final 
position there is only one exception 


Other phoneme /Jf/ never initial; word-finally only in liquorice 
pronounced /'Irkar1f/ (also pronounced /'Itkaris/); 
otherwise only medial: regular in the ending 
<-aceous> pronounced /'etfas/, e.g. cretaceous, 
curvaceous, herbaceous, sebaceous and about 100 
other words, mostly scientific and all very rare, plus 
cetacean, crustacea(n), Echinacea, ocean, siliceous 


THE REST 


pronounced 


Exception to main system word-final <ce> /Jf/, not /s/ only in liquorice 
pronounced /'Itkar1f/ 


Oddities (none) 


2-phoneme graphemes (none) 
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NOTES 


Gontijo et al. (2003) do not recognise word-final <ce> as a separate 
grapheme, so give data only for its medial occurrences. However, it is clear 
that in both of the very restricted circumstances where it is a separate 
grapheme <ce> is virtually 100% regular. 

In all unsuffixed words with medial <ce> as a digraph the stress falls 
on the vowel preceding /s/ spelt <ce>, and that vowel is spelt with a single 
letter which has its letter-name pronunciation (only exception: siliceous 
/st'ltfas/). 

In many words, word-final <e> after <c> following a single vowel letter 
is also part of a split digraph with the vowel letter; see the entries for the 
six split digraphs in chapter 10, sections 10.4/17/24/28/38/40. However, 
in some words the vowel letter preceding <ce> is a separate grapheme with 
its ‘short’ pronunciation, e.g. practice; for these exceptions also see the 
sections just cited. 

In all cases other than those defined above, <c, e> are separate 
graphemes; in particular, note oceanic /ausi:'janik/, panacea /pznea'si:ja/. 
Word-final <c, e> are separate graphemes only in fiance, glace (now 
increasingly spelt even in English text with French <é>). 


9.9 <ch> 


N.B. <tch> has a separate entry. 


THE MAIN SYSTEM 
Basic phoneme /tf/ 87% e.g. chew, detach 
THE REST 
pronounced 
Exceptions to main <ch> /k/ 10% regular (no exceptions) before a 
system consonant letter, e.g. aurochs, chlamydia, 


chloride, chlorine, chrism, Christ(ian(ity)), 
Christmas, Christopher, chrome, 
chromosome, chronic and every other 
word beginning <chron->, chrysalis, 
chrysanthemum, drachma, lachrymose, 
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<ch> 


/f/ 


ochre, pinochle, pulchritude, sepulchre, 
strychnine, synchronise, technical, 
technique; also in many words of Greek 
origin, e.g. amphibrach, anarchy, 
anchor, archaic and every other word 
beginning <arch-> where the next 
letter is a vowel letter (exceptions: 
arch-enemy, archer, with /t{/), brachial, 
brachycephalic, bronchi(al/tis), 
catechis-e/m, chalcedony, chameleon, 
chaos, character, charisma, chasm, 
chemical, chemist, chiasma, chimera, 
chiropody (also pronounced with initial 
/§/), choir, cholesterol, cholera, choral, 
chord, choreography, chorus, chyle, 
chyme, cochlea, diptych, distich, echo, 
epoch, eschatology, eucharist, eunuch, 
hierarch(y) and every other polysyllabic 
non-compound word ending <-arch(y)>, 
hypochondriac, ichor, lichen pronounced 
/‘latkan/ (also pronounced /'I1tfan/), 
machination, malachite, mechani-c/sm, 
melanchol-y/ic, orchestra, orchid, 
pachyderm, parochial, pentateuch, 
psyche and all its derivatives, scheme, 
schizo and all its derivatives, scholar, 
school, stochastic, stomach, synecdoche, 
trachea, triptych, trochee. Words of 
non-Greek origin in this group are 
ache, baldachin, chianti, chiaroscuro, 
cromlech, Czech, masochist, Michael, 
mocha, oche, scherzo, schooner; also 
broch, loch, pibroch, Sassenach when 
pronounced with /k/ rather than Scots 
/x/. See Notes 


2% phonemically and orthographically 
word-finally only in (Germanic) 

milch, mulch, Welch; otherwise only 

in about 50 words of mainly French 
origin, namely (initially) chagrin, chaise, 
chalet, chamois, champagne, chancre, 
chandelier, chaperone, charabanc, 
charade, charlatan, Charlotte, 
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chassis, chateau, chauffeu-r/se, 
chauvuinism, chef, chemise, chenille, 
cheroot, chevalier, chevron, Chicago, 
chi-chi (twice), chic(ane(ry)), chiffon, 
chignon, chivalr-ic/ous/y, chute; also 
sometimes in (Greek) chiropody (hence 
the punning shop name Shuropody); 
(medially) attache, brochure, cachet, 
cachou, cliche, crochet, duchesse, 
echelon, embouchure, Eustachian, 
machete, machicolation, machine, 
marchioness, nonchalant, parachute, 
pistachio, recherche (twice), ricochet, 
ruching, sachet, touche; (phonemically 
but not orthographically word-finally) 
fiche, gouache, moustache, niche 
pronounced /ni:f/ (also pronounced 
/nit{/), pastiche, quiche, ruche. 
Contrast word-final <che> pronounced 
/J/ and word-final <ch, e> as separate 
graphemes, below 


<ch> /d3/ 1% only in ostrich, sandwich, spinach 
pronounced /'pstrid3, 'semwid3, 
'spinidz/ 
Oddities <che> /S/ only in barouche and about 13 words 


of French origin, namely (medially) only 
rapprochement, (finally) avalanche, 
blanche, brioche, cache, cartouche, 
cloche, creche, douche, farouche, 
gauche, louche, panache. |n all these 
words the final <e> is irrelevant to the 
pronunciation of the preceding vowel 
grapheme. Contrast the words where 
word-final <e> after <ch> is instead 
part of a split digraph (ache and fiche ... 
ruche two paragraphs above) and word- 
final <ch, e> as separate graphemes, 
below 


<che> /tf/ only in niche pronounced /nitf{/ (also 
pronounced /ni:f/) 


<chs> /S/ only in fuchsia /'fju:fe/ 


2-phoneme graphemes (none) 
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NOTES 


There are a few cases in which word-final <ch, e> constitute two graphemes 
rather than one: attache, cliche, recherche, touche with /fe1/ (sometimes 
spelt even in English text with French <é>), menarche, oche, psyche, 
synecdoche with /ki:/, but there appear to be no cases at all in which <c, h> 
are separate graphemes. 

<ch> is also sometimes pronounced /x/ as in Scots broch, dreich, loch, 
Sassenach and German-style pronunciations of names like Schumacher, but 
| have not included this correspondence in my analysis because /x/ is not 
a phoneme of RP. 


9.10 <ci> 


Only medial. 


THE MAIN SYSTEM 


Basic phoneme /J/ 100% regular when both preceded and 
followed by vowel letters, e.g. 
audacious, magician, specious. 
Extension: commercial, where the 
preceding <er> digraph nevertheless 
spells a (long) vowel phoneme. See 


also Notes 

THE REST 

pronounced 
Exceptions to main <ci> /tf/ only in ancient /‘e1ntfant/, ciabatta /t{a'xta/ 
system 
<ci> /3/ only, exceptionally but increasingly, in 

coercion pronounced /kau'w3:3an/ (usually 
pronounced /kau'w3:fen/) 

Oddities (none) 

2-phoneme (none) 


graphemes 
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NOTES 


In most cases the stress falls on the vowel preceding /J/ spelt <ci>, and that 
vowel is spelt with a single letter which has its letter-name pronunciation. 
Exceptions: if the preceding vowel letter is <i> it is pronounced /1/, e.g. 
magician; also precious, special with /e/. 

In all other cases, <c, i> are separate graphemes. 


9.11 <ck> 


Never initial. 
THE MAIN SYSTEM 
Only phoneme /k/ 100% e.g. black 
THE REST 
pronounced 
Exceptions to main system (none) 
Oddity <ckgu> /g/ only in blackguard 
/‘'bleged, 'blega:d/ 
2-phoneme graphemes (none) 
NOTE 


The only word in which <c, k> belong to separate morphemes and therefore 
graphemes seems to be acknowledge, and even there the phoneme is /k/. 
This counts as a curious ‘surfacing’ sound - see section 7.2. 


9.12 <d, dd> 


N.B. <dg, dge> have a separate entry. <ed>, as in past tense and participle 
verb forms, has a separate entry in chapter 10, section 10.15. 


THE MAIN SYSTEM 


Basic phoneme /d/ <d> 99%, e.g. bud, buddy 
<dd> 100% 
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THE REST 


Exceptions to main <d> 


system 


Word-final doubled (does 


letter + <e> 


Oddities 


not 
occur) 


<ddh> 


<de> 


<dh> 


<di> 


<dj> 


<dne> 


pronounced 


/d3/ 


/d/ 
/d/ 


/d/ 


/d3/ 


/d3/ 


/n/ 


1% of correspondences for <d>. Never word- 
final; regular initially and medially before <u> 
followed by another vowel letter or <r>, e.g. 
arduous, assiduous, (in)credulous, deciduous, 
dual/duel (cf. the homophone jewel), due (cf. 
the homophones dew, Jew), duet, duke, dune, 
dupe, duty, education, graduate pronounced 
either /'greedgu:wat/ (noun) or /'gredgu:wert/ 
(verb), durable, duration, duress, during, 
endure, fraudulen-ce/t, glandular, modul-e/ar, 
nodul-e/ar, pendulum, sedulous, procedure, 
verdure (cf. the homophone vergen; also 

in gradual, individual, residual whether 
pronounced with /dgu:wal/ or /dgal/ (for the 
elision of the <u> see section 6.10). Also in 

a few words before <eu, ew>: deuce (cf. the 
homophone juice), various words beginning 
with (Greek) deuter-, dew (cf. the homophones 
due, Jew), grandeur. See Notes 


only in Buddha and derivatives, saddhu 


only in aide, blende, blonde, horde and in 
bade, forbade (past tenses of bid, forbid) 
pronounced /bed, fa'bed/ (also pronounced 
/berd, fa'berd/) 


only in a few loanwords from the Indian 
subcontinent, e.g. dhobi, dhoti, dhow, Gandhi, 
jodhpurs, sandhi, Sindh 


only in cordial pronounced/'k>:dgal/ (also 
pronounced /'ko:di:jal/), incendiary, 
intermediary, stipendiary, subsidiary 
pronounced with /dgari:/, soldier 


only in about 10 words containing the (Latin) 
prefix <ad->: adjacent, adjective, adjoin, 
adjourn, adjudge, adjudicate, adjunct, adjure, 
adjust, adjutant, plus djinn 


only in Wednesday 
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<dt> /t/ only in veldt 
2-phoneme (none) 
graphemes 
NOTES 


For <da> in dromedary, lapidary, laudanum, legendary, secondary, <de> 
in broadening, considerable, gardener, launderette, widening and <di> in 
medicine see section 6.10. 

All the words in which <d> is pronounced /dg/ were formerly pronounced 
with the sequence /dj/, and conservative RP-speakers may still pronounce 
them that way (or imagine they do). Pronunciations with /dj/ would require 
an analysis with the <d> pronounced /d/ and and the /j/-glide as part of 
the pronunciation of the <u> and following <r> or vowel letter. See <t>, 
section 9.33, for the largely parallel correspondence to voiceless /t{/, and 
<di> in the Oddities. 


9.13 <dg, dge> 


Only phoneme /dB/ 100% e.g. badger, bridge, bridging, curmudgeon 


NOTE 


There seem to be no cases where <d, g(e)> are separate graphemes except 
at morpheme boundaries, e.g. headgear. 


N.B. <ed> Though this grapheme has mainly consonant pronunciations, 
because it begins with a vowel letter it is covered in chapter 10, section 10.15. 


9.14 <f, ff> 


For percentages see Notes. 


THE MAIN SYSTEM 

Basic phoneme /f/ <f> e.g. full. 100% provided of is treated as a special case 
<ff> 100%. e.g. cliff, staff 

Other phoneme /v/ only in of and roofs pronounced /ru:vz/ 


for <f> 
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THE REST 
pronounced 

Exceptions to main (none) 

system 

Word-final doubled <ffe> /f/ only in gaffe, giraffe, pouffe; also 

letter + <e> in usual pronunciation of different, 
difference, sufferance (but not afferent, 
efferent) - see also section 6.10 

Oddities <fe> /f/ only in carafe and some instances of 
elided vowels - see Notes 

<ft> /f/ only in often, soften 

2-phoneme (none) 

graphemes 

NOTES 


Gontijo et al. (2003) found that 88% of all occurrences of <f> in their 
database were <f> pronounced /v/ in of, and only 12% were <f> pronounced 
/f/ in other words, but this is thoroughly misleading. Provided <f> in of is 
recognised as a special case (and roofs pronounced /ru:vz/ is rare), all other 
graphemes beginning <f> are pronounced /f/, = 100% predictable. 

For <(f)fe> in cafetiere, conference, deference, difference, different, 
offering, preferable, preference, sufferance, <fi> in definitely, <for> in 
comfortable, <fu> in beautifully, dutifully see section 6.10. 


9.15 <g, gg> 


N.B. <dg(e), ge, ng> have separate entries. The entry for <ng> also covers 
all the cases where <n> before <g> is a separate grapheme. 


THE MAIN SYSTEM 


Basic phoneme /g/ <g> 71%, e.g. game, braggart, egg. Regular 
<gg> 70% except for <g> before <e, i, y>, 
but see the exceptions. Also see 
Notes 


Other phoneme /d3/ 28% of corres- Regular before <e, i, y>. See Notes 
for <g> pondences for 
<g> 


THE REST 


Exceptions to main 
system 


<g> 


<g> 


<g> 


<g> 
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pronounced 


/g/ 


/d3/ 


/k/ 


/3/ 


exceptions for <g> are 1% of its 
correspondences in total 


before <e, i, y> in auger, beget, bogie, 
bogey, conger, eager, finger, fogey, gear, 
gecko, geese, gel (/gel/, conservative 
pronunciation of girl; contrast gel ‘viscous 
liquid’ pronounced /dgel/), geld, get, geyser, 
hegemon-y/ic, laager, lager, monger 

and all its compounds, renege (for this 
word see also <e.e>, section 10.17, and 
Notes to next section), target (contrast 
parget, with regular /d3/), tiger, together, 
anthropophagi, begin, giddy, gill (‘lung 

of fish’; contrast gill ‘quarter of a pint’ 
pronounced /dzi1I/ and see Notes), gillie 
(also spelt ghillie), gilt, gimbals) (also 
pronounced with /g/), gimlet, gimp, gird, 
girdle, girl, girn, girt, girth, give, gizzard, 
yogi and first <g> in gig, giggle, gingham, 
gynaecology 


not before <e, i, y> only in gaol, margarine 
(also pronounced with /g/), Reg, veg, and 
second <g> in mortgagor 


only in length, lengthen, strength, 
strengthen pronounced /lenk®, 'lenk@an, 
strenk®, 'strenk@an/ (also pronounced 
/len®, 'len@an, stren®, 'stren6an/) - for the 
rationale of this analysis see Notes under 
/n/, section 3.8.2 - and in angst /enkst/, 
disguise /dt1s'ka1z/, disgust pronounced 
/dts'kast/, i.e. identically to discussed; 
disguise, disgust are also pronounced 
/diz'gaiz, diz'gast/, i.e. with <s, g(u)> both 
voiced rather than voiceless 


initially, only in genre, gilet; medially, only 
in aubergine, conge, dirigiste, largesse, 
negligee, protege, regime, tagine and 
lingerie pronounced /'‘lengari:/ (also 
pronounced /'‘londgaretr/) 
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Word-final doubled 
letter + <e> 


Oddities 


<gg> 


(does not occur) 


<gh> 


<gh> 


<gh> 
<gh> 


<gi> 


<gl> 


<gm> 


<gn> 


/d3/ 


/f/ 


/g/ 


/k/ 
/p/ 
/d3/ 


/M/ 


/m/ 


/n/ 


30% of correspondences for <gg>, but 
occurs only in arpeggio, exaggerate, loggia, 
Reggie, suggest, veggie, vegging. See Notes 


75% of pronunciations for <gh>, but see 
Notes/. Medially, only in draught, laughter, 
otherwise only word-final and only in 
chough, cough, enough, laugh, rough, 
slough (‘shed skin’), sough, tough, trough 


25% of pronunciations for <gh>, but see 
Notes. Word-final only in ugh; otherwise 
only in afghan, aghast, burgher, ghastly, 
ghat, ghee, gherkin, ghetto, ghillie (also 
spelt gillie), ghost, ghoul, ogham, sorghum 
and a few more rare words 


only in hough /hok/ 
only in misspelling of hiccup as *hiccough 


only in allegiance, collegial, contagio-n/us, 
egregious, legion, litigious, plagiaris-e/m, 
prestigious, region, religio-n/us, vestigial 


only in a few Italian loan words, namely 
imbroglio, intaglio, seraglio, tagliatelle 


only in apophthegm, diaphragm, 
epiphragm, paradigm, phlegm, syntagm. 
/g/ surfaces in paradigmatic, phlegmatic, 
syntagma(tic) - see section 7.2 


only in (initially) gnarl, gnash, gnat, gnaw, 
gneiss, gnome, gnosis, Gnostic, gnu (only 
exception: gnocchi, with /nj/, though gnu 
could also be analysed that way, with <gn> 
pronounced /nj/ and <u> pronounced 

/ux/ rather than /ju:/ - take your pick); 
(medially) cognisance (also pronounced 

with /gn/), physiognomy, recognise 
pronounced /'rekanaiz/ (usually pronounced 
/'rekagnatiz/); (finally) align, arraign, assign, 
benign, campaign, coign, condign, consign, 
deign, design, ensign, feign, foreign, impugn 
and a few other very rare words in -pugn, 
malign, reign, resign, sign, sovereign, thegn, 


2-phoneme 
grapheme 


<gne> 


<gu> 


<gue> 


<gn> 


/n/ 


/g/ 


/g/ 


/nj/ 
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also phonemically word-final in 
champagne, cologne where the final 

<e> is part of a split digraph with the 
letter before the <g>. /g/ surfaces in 
agnostic, diagnosis, prognosis, malignant, 
pugnacious, repugnant, assignation, 
designation, resignation, signal, signature 
- see section 7.2. For exceptions to <gn> 
pronounced /n/ see the 2-phoneme 
grapheme below 


only word-final and only in cockaigne, 
epergne, frankalmoigne /ka'ketn, 1'p3in, 
'frenkelmoin/. In soigne /swa:'njer/ <gn, 
e> are separate graphemes 


only in (initially) guarantee, guard, guerrilla, 
guess, guest, guide, guild, guilder, guile, 
guillemot, guillotine, guilt, guinea, guise, 
guitar, guy and a few more rare words; 
(medially) baguette, dengue, disguise 
pronounced /diz'gatz/ (also pronounced 
/dt1s'ka1z/), languor (the <u> surfaces as 
/w/ in languid, languish - see section 7.2) 
and suffixed forms of a few words in next 
category, e.g. cataloguing, demagoguery, 
(phonemically word-finally) plague, vague; 
fatigue, intrigue, brogue, drogue, rogue, 
vogue; fugue and a few more rare words; in 
this group the vowel letter before <g> and 
the final <e> form a split digraph - contrast 
ague /‘etgju:/ and dengue /'denge1/, and see 
<ngu, ngue> under <ng>. Also see Notes 


only word-final and only in analogue, 
catalogue, colleague, decalogue, 
demagogue, dialogue, eclogue, epilogue, 
ideologue, league, monologue, morgue, 
pedagogue, prologue, prorogue, synagogue, 
where the final <e> is irrelevant both to 
the ‘short’ pronunciation of <o> and to the 
‘long’ pronunciations of <ea, or> preceding 
<gu>. In US spelling several of these words 
are spelt without the final <ue> 


only in chignon, cognac, gnocchi, lasagne, 
lorgnette, mignonette, monsignor, poignant, 
seigneur, soigne, vignette and possibly gnu 
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NOTES 


Given the small numbers of words in which the major correspondences for 
<g> do not apply, those two correspondences stated context-sensitively 
mean that pronunciations of <g> are 99% predictable. There are, however, 
a few homograph pairs with <g> pronounced /g/ in one and /d3/ in the 
other: gel /gel/ (posh pronunciation of girl v. /dgel/’hair lotion’; gill /‘gtl/ 
‘lung of fish’ v. /dgr1l/ ‘quarter of pint’, Gillingham /‘gtltinam/ in Dorset and 
Norfolk v. /‘dgilznam/ in Kent. 

For words containg <n, g> before <e, i> in which the pronunciation of 
the <g> as /g/ is irregular see section 9.24. 

Despite /dj/ being 30% of correspondences for <gg> | have not 
recognised it as a major correspondence because it occurs in so few words, 
and its high frequency seems to be almost entirely due to the two common 
words exaggerate, suggest - and suggest, pronounced /sa'dgest/ in RP, has 
a different pronunciation in General American: /sag'dgest/; here the <g>’s 
are separate graphemes representing separate phonemes - but this is no 
more ‘regular’ than the RP pronunciation because it is the only case where 
two consecutive <g>’s do not form a digraph - indeed, the only case of 
geminate consonant letters which would otherwise constitute a digraph not 
doing so. 

The contexts in which <gh> is pronounced /g/ are easily defined - but 
so is the list of about a dozen words where this correspondence occurs. 
<gh> is also sometimes pronounced /x/ as in Irish lough and names like 
McCullough, Naughtie, but | have not included this correspondence in my 
analysis because /x/ is not a phoneme of RP. 

<gh> is never a separate grapheme after <ai, ei> - see <aigh, eigh> 
under <ai, e>, sections 10.5, 10.12. However, no rule can be defined 
to distinguish the 10 or 11 words where <gh> is a separate grapheme 
pronounced /f/ after <au, ou> from those where <augh, ough> are four- 
letter graphemes, so these just have to be learnt. See also <augh> under 
<au>, section 10.9, and Notes to section 10.33 on <ough>. 

<gu> mostly has 2-phoneme pronuncations, e.g. /gw/ in anguish, 
distinguish, extinguish, guacamole, guano, guava, iguana, language, 
languish, linguist, penguin, sanguine, segue, unguent; /ga/ in gulf, gust, etc. 

For <ga> in vinegary, <go> in allegory, category, <gu> in figurative, 
see section 6.10. 
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9.16 <ge> 


N.B. <dge> has a joint entry with <dg>. 


THE MAIN SYSTEM 


For both categories and for absence of percentages see Notes. 


Basic phoneme /d3/ word-initially, only in geograph-er/y, geomet-er/ry, 
Geordie, George, Georgia(n), georgic, rare medially, 
but cf. burgeon, dungeon, gorgeous, hydrangea, 
pageant, sergeant, sturgeon, surgeon, vengeance 
where the following vowel letter or digraph 
is pronounced /3/, plus pigeon with /1/; also 
dangerous, vegetable - see section 6.10; also 
singeing, swingeing (as distinct from singing, 
swinging), whingeing; word-finally, regular in 
hundreds of words ending <-age> pronounced 
/1d3/, €.g. garage pronounced /'gzrid3/, 
haemorrhage, image, language, mortgage, village 
(for other words in <-age> see previous section); 
also in, e.g., allege, blancmange, change, college, 
flange, hinge, lounge, orange, sacrilege, scavenge 


Rare phoneme /3/ never initial; medially, only in bourgeoisie), 
mangetout, word-finally, only in about 25 words 
of mainly French origin, namely beige, cortege, 
concierge, liege, melange, rouge and, with the 
<e> also forming part of the split digraphs <a.e, 
i.e, u.e> (for dual-functioning see section 7.1), in 
badinage, barrage, camouflage, collage, corsage, 
decalage, décolletage, dressage, entourage, 
espionage, fuselage, garage pronounced /'gexra:3/, 
massage, mirage, montage, triage, sabotage; 
prestige; luge 


THE REST 


(None). 
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NOTES 


Gontijo et al. (2003) do not recognise <ge> as a grapheme, so give no data 
for it. However, given that very few words have <ge> pronounced /3/, the 
percentage for /d3/ would be high. 

In many words, final <e> after <g> following <a> is part of a split 
digraph with the <a> - see section 10.4. There are also a very few examples 
ending <ege, ige, oge, uge> (sections 10.17/24/28/38) and none ending 
<yge> (section 10.40). On split digraphs see also section A.6, and for dual- 
functioning see section 7.1. 

Except in the roughly 24 words listed under the basic phoneme, initial 
and medial <g, e> are always separate graphemes. Word-finally, the only 
such cases appear to be conge, protege with /3e1/ (sometimes spelt even 
within English text with French <é>), sylloge with /dgi:/. In renege /r1'ne1g/ 
| analyse <e.e> as a split digraph pronounced /e1/ - see sections 10.17 and 
A.6 - and the <g> as a single-letter grapheme pronounced (uniquely in this 
position, and irregularly before <e>) /g/ (contrast allege, college, sacrilege 
with /d3/, cortege with /3/). 


N.B. For <gg> see under <g>. 


9.17 <h> 


Never occurs as a single-letter grapheme in word-final position. 
N.B. <ch, ph, sh, tch, th, wh> have separate entries. 


THE MAIN SYSTEM 


Basic phoneme /h/ 100% e.g. cohort, have 
THE REST 
pronounced 
Doubled letter (<hh> occurs only in compound words, 


e.g. bathhouse, where the two letters 
belong to separate morphemes and 


graphemes) 
Exceptions to main <1% 
system 
<h> /j/ only in a very few words between 2 


vowels, namely annihilate, vehement, 


The grapheme-phoneme correspondences, 1 297 


vehicle, vehicular /a'natjtlett, 'virjamant, 
‘virjikal, vis'jikjala/ 
Oddities <hea> /1/ only in forehead pronounced /'for1d/ 
<heir> /ea/ only in heir and derivatives (but there 
is /r/-linking in heiress, inherit - see 


section 3.6; and in inherit /h/ also 
surfaces; see section 7.2) 


<ho> /o/ only in bonhomie, honest, honour and 
derivatives 
<hu> /w/ only in chihuahua (twice) 
2-phoneme grapheme <hour>  /auwa/ only in hour 


N.B. For <i> pronounced as the consonant phoneme /j/ see, nevertheless, 
the entry for <i> in chapter 10, section 10.22. 


9.18 <j> 


THE MAIN SYSTEM 
Basic phoneme /d3/ 100% e.g. jet, majesty 
THE REST 
pronounced 
Doubled letter <jj> /c3/ only in hajj 
Exceptions to main <1% in total 
system 
<j> /j/ only in hallelujah /helt'lu:ja/, and majolica 
pronounced /mar'joltka/ (also pronounced 
/ma'dgvltka/) 
<j> /3/ only in jihad, raj and some rare French 
loanwords, e.g. bijou, goujon, jabot, 
jalousie, jupe 
<j> /h/ only in fajita, jojoba (twice), marijuana, 
mojito, Navajo’ 
Oddities (none) 
2-phoneme (none) 


graphemes 
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9.19 <k> 


N.B. <ck> has a separate entry. 


THE MAIN SYSTEM 


Only phoneme 


THE REST 


Doubled letter 


Exceptions to main 
system 


Word-final doubled 
letter + <e> 


Oddities 


/k/ 


<kk> 
(none) 
(does not 
occur) 


<ke> 


<kh> 


<kn> 


2-phoneme graphemes (none) 


NOTE 


100% 


pronounced 


/k/ 


/k/ 
/k/ 


/n/ 


e.g. kelp, kit, sky 


only in chukker, dekko, pukka and 
inflected forms of trek, e.g. trekkie 


only in Berkeley, burke 


only in astrakhan, gurkha, gymkhana, 
khaki, khan, khazi, khedive, sheikh, 
Sikh. See Note 


only in knack(er(s)), knap, knave, 
knead, knee, knell, knew, knick(ers)), 
knickerbocker, knick-knack, knife, 
knight, knit, knob, knobbly, knock, 
knoll, knot, knowledge), knuckle 

and a few more very rare words. 
Contrast Knesset, with /kn/, and for 
acknowledge see section 7.2 


<kh> also occurs in transcriptions of some Russian names, e.g. Khrushchev, 


Mikhail, where it is meant to represent the /x/ phoneme, like <ch> in 


Scots loch - but since (a) most English-speakers instead pronounce these 
names with /k/ (as in the words listed above under Oddities), and (b) the 
correspondence with /x/ occurs only in names, | have not included this 


correspondence in my analyses. 
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9.20 <1, ll> 


N.B. <le> has a separate entry. 


THE MAIN SYSTEM 

Basic phoneme _ /I/ 100% e.g. lift, fill 

THE REST 

pronounced 
Exceptions to main <\> as 2-phoneme only in axolotl, dirndl, shtetl 
system sequence /al/ /‘eksalotal, 'd3:ndal, ‘ftetal / 
<I> /j/ only in French-/Spanish-like 
pronunciations of bouillabaisse, 
marseillaise, tortilla /buxja:'bes, 
maiser'jez, to:'titjar/ 
<I> as 2-phoneme only in carillon /ka'r1ljan/ 
sequence /lj/ 

Word-final doubled <\le> /\/ medially, only in decollet-age/ee; 

letter + <e> otherwise only final and only in the 
ending -ville, e.g. vaudeville, plus 
bagatelle, belle, braille, chanterelle, 
espadrille, fontanelle, gazelle, grille, 
pastille, nacelle, quadrille (but not 
reveille, tagliatelle where the <e> is 
pronounced /i:/). In chenille, tulle 
| analyse <II> as pronounced /I/ 
and <i.e, u.e> as split digraphs 
pronounced /i:, u:/ - see sections 
5.7.2, 5.7.6, A.6 - and medially in 
guillemot <l|le> is pronounced /li:/ 

Oddity <Ih> /\/ only in philharmonic, silhouette 

2-phoneme (see above) 

graphemes 

NOTE 


For <lle> in chancellery, jewellery see section 6.10. 
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9.21 <le> 


Only final. 


THE MAIN SYSTEM 


Basic /al/ 100% only word-final after a 
pronunciation consonant letter, e.g. 
table, visible 


THE REST 
pronounced 
Exceptions to main <le> /\/ medially, only in Charles; 
system otherwise only word-final and 
only in aisle, cagoule, clientele, 
gargoyle, gunwale, joule, isle, 
lisle, voile. See Notes 
Oddities (none) 
2-phoneme (The basic 
graphemes pronunciation 
is a 2-phoneme 
sequence) 
NOTES 


In many words where final <le> follows a vowel letter and the main rule above 
therefore does not apply, word-final <e> after <I> following a single vowel 
letter is part of a split digraph with the vowel letter; see the entries for the six 
split digraphs in chapter 10, sections 10.4/17/24/28/38/40. 

Initial and medial <I, e> are always two separate graphemes. Word-finally, 
the only such cases (i.e. the <e> is neither part of a split digraph nor part of 
a digraph with <I>) appear to be souffle (sometimes spelt even within English 
text with French <é>) with /let/, facsimile, hyperbole, ukulele with /li:/, 
biennale, finale, guacamole, tamale with either. 

The reason for picking out aisle, cagoule, clientele, gargoyle, joule, isle, lisle, 
voile as having word-final <le> is that the preceding vowel grapheme would be 
pronounced the same if the <e> were not present. Some of the spellings would 
then look even odder, but cagoule does have the alternative spelling kagoul. 


N.B. For <Il> see under <I>. 
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9.22 <m, mm> 


THE MAIN SYSTEM 


Basic phoneme /m/ 100% e.g. mum, sum, mummy, 
summit 


THE REST 


pronounced 


Exceptions to main <1% in total 
system 


<m> as 2-phoneme | only word-finally, but regular in 
sequence /am/ all the words ending in <-sm>, 

e.g. chasm, enthusiasm, orgasm, 
phantasm, pleonasm, sarcasm, spasm, 
several words ending in -plasm (e.g. 
ectoplasm), chrism, prism, schism and 
all the many other words ending in 
-ism, macrocosm, microcosm, abysm, 
aneurysm (also spelt aneurism), 
cataclysm, paroxysm, plus algorithm, 
rhythm and a few other very rare 
words; also film pronounced /'ftlam/ 
in some Irish accents 


Word-final doubled <mme> = /m/ now only in oriflamme and (non- 

letter + <e> computer) programme since gram 
and its derivatives are no longer 
spelt *gramme, etc.; in consomme 
(sometimes spelt even within English 
text with French <é>), <mm, e> are 
separate graphemes 


Oddities <mb> /m/ only word-final and only in 
dithyramb, lamb; climb, limb; aplomb, 
bomb, catacomb, comb, coomb, 
coxcomb, coulomb, hecatomb, rhomb, 
tomb, womb; crumb, dumb, numb, 
plumb, rhumb, succumb, thumb 
and a few more very rare words. /b/ 
surfaces in dithyrambic, bombard ier), 
bombastic, rhomb-ic/us, crumble and 
supposedly, according to some 
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authorities, in thimble (from thumb) 
- see section 7.2. The word-form 
number has the two pronunciations 
/'namba/ (‘amount, numeral’) 

and /'nama/ (‘having less feeling’, 
comparative form of the adjective 
numb) 


<mbe> = /m/ only word-final and only in buncombe 
(‘nonsense’; also spelt bunkum), 
co(o)mbe (‘short valley’; also spelt 
coomb); contrast flambe /'flombet1/ 
(sometimes spelt even within English 
text with French <é>), where <m, b, 
e> are all separate graphemes 


<me> /m/ never initial; mainly word-final and 
there only in become, come, some, 
welcome and the adjectival suffix 
/sam/ spelt <-some>, e.g. handsome 
(contrast hansom); medially only 
in camera, emerald, omelette, 
ramekin pronounced /'remkin/ (also 
pronounced /'remrkin/) - see section 
6.10 - and Thames 


<mn> /m/ 100% of pronunciations of <mn> 

but see Notes. Only word-final and 
only in autumn, column, condemn, 
contemn, damn, hymn, limn, solemn. 
/n/ surfaces in autumnal, columnar, 
columnist, condemnation, contemner, 
damnable, damnation, hymnal, 
hymnody, solemnity - see section 7.2 


<mn> /n/ <1% of pronunciations of <mn> 
but see Notes. Only in mnemonic, 
mnemonist. /m/ surfaces in amnesia, 
amnesty - see section 7.2 


2-phoneme grapheme (see above) 


NOTES 


Given the very different word positions of <mn> pronounced /m, n/ this 
grapheme is 100% predictable. Given that it never occurs medially it is also 
very easy to distinguish from instances of <m, n> as separate graphemes. 
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For <ma> in customary, <me> in camera, emerald, omelette, <mi> in 
admirable, family see section 6.10. 


9.23 <n, nn> 


N.B. <ng> has a separate entry, which also covers all the cases where <n> 
before <g> is a separate grapheme, including those mentioned here where 
the <n> is pronounced /n/. 


THE MAIN SYSTEM 
Basic phoneme /n/  <n> 85%, e.g. tin, tinny. For <n>, /n/ 
<nn> 100% _ is regular except before <c> 
pronounced /k/ and before <ch, g, 
k, q, X>. See Notes 
Other phoneme /n/ 15% regular before <c> pronounced 
for <n> /k/ and before <ch, g, k, q, x>, 
e.g. concur pronounced /kan'k3:/, 
uncle, zinc; anchor, synchronise; 
angle, England, fungus, language, 
langur, length pronounced /lenk@/, 
longevity, prolongation, single; ankle, 
sink, thanks, banquet, conquer, 
anxiety, anxious, larynx, lynx. 
See Notes 
THE REST 
pronounced 
Exceptions to main <1% 
system 


<n> as 2-phoneme only in Haydn (| mention him in memory 
sequence /an/ of Chris Upward of the Simplified Spelling 

Society) and most contractions of not with 
auxiliary verbs, i.e. isn’t, wasn’t, haven't, 
hasn’t, hadn’t, doesn’t, didn’t, couldn't, 
shouldn’t, wouldn’t, mayn’t, mightn’t, 
mustn’t, oughtn’t, usedn’t, some of which 
are rare to the point of disuse, plus durstn’t, 
which is dialectal/comic; in all of these 
except mayn’t the preceding phoneme 
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Word-final doubled <nne> /n/ 


letter + <e> 


Oddities 


<nc> 
<nd> 


<nd> 


<nd> 


<ne> 


/1/ 
/m/ 
/n/ 


/1/ 


/n/ 


is a consonant. Other contractions of not 
with auxiliary verbs (ain’t, aren’t, can’t, 
daren’t, don’t, shan’t, weren’t, won’d), i.e. 
all those with a preceding vowel phoneme 
(except mayn’t) are monosyllabic (though 
some Scots say /'dearant/ with a preceding 
consonant and therefore two syllables, 

and also /r/-linking - see section 3.6). 
Curiously, innit, being a contraction of 
isn’t it, reduces isn’t to a single syllable 


only word-final and only in cayenne, 
comedienne, cretonne, doyenne, tonne 
and a few other rare words 


only in charabanc /‘feraben/ 
only in sandwich /'semwid3/ 


only in grandfather, Grandma (hence the 
frequent misspelling *Granma - cf. section 
4.4.7 on Gran(d)dad), handsome 

(cf. hansom (cab)), landscape 


only in handcuffs, handkerchief /‘henkafs, 
‘henkatfif/ 


non-finally, only in vineyard (and even 
there it is stem-final within a compound 
word) and with an elided vowel (see 

section 6.10) in confectionery, generative, 
stationery, vulnerable; otherwise only 
word-final after a vowel letter and only in 
about 35 words, namely bowline, Catherine, 
clandestine pronounced /klen'destin/ 
(also pronounced /'klandastatn/), cocaine, 
compline, crinoline, demesne, (pre)destine, 
determine, discipline, done, engine, ermine, 
examine, famine, feminine, genuine, gone, 
groyne, heroine, hurricane pronounced 
/‘hartken/ (also pronounced /'hartketn/), 
illumine, intestine, jasmine, marline, 
masculine, medicine, migraine, moraine, 
peregrine, ptomaine, saccharine, sanguine, 
scone pronounced /skon/ (also pronounced 
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/skaun/), shone, urine, vaseline, wolverine. 
In all but one of these words the <e> is 
phonographically redundant, in that its 
removal would not affect the pronunciation 
- the preceding vowel letter (if single) does 
not have its ‘letter-name’ pronunciation, 
and where there are two vowel letters they 
either form a digraph (cocaine, groyne, 
migraine, moraine, ptomaine) or are 
pronounced separately (genuine). The 
exception is done, which needs to be kept 
visually distinct from don, as heroine and 
marline (‘rope’) are from heroin and marlin 
(‘fish’). The only words in which final <n, e> 
are separate graphemes are are aborigine, 
acne, anemone 


<nt> /n/ only in denouement, divertissement, 
rapprochement 
<nw> /n/ only in gunwale 
2-phoneme (see above) 
grapheme 
NOTES 


Given the small numbers of words in which the major correspondences for 
<n> do not apply, those two correspondences stated context-sensitively 
mean that pronunciations of <n> are virtually 100% predictable. Actually, 
they occur even without being consciously noticed because of the 
phonological context. 

Some words beginning encephal-, e.g. encephalitis, are pronounced 
either /ens-/, with the predominant pronunciation of <n> as /n/, or /enk-/, 
with the regular pronunciation of <n> as /n/ before <c> pronounced /k/. 

For <na> in concessionary, coronary, culinary, discretionary, 
extraordinary /1k'stro:danri:/, imaginary, legionary, mercenary, missionary, 
ordinary, precautionary, preliminary, probationary, pulmonary, reactionary, 
revolutionary, stationary, urinary, veterinary /'vetrinri:/, visionary, <ne> 
in confectionery, general, generative, millinery, stationery, <nou> in 
honourable see section 6.10. 
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9.24 <ng> 


Never initial. 


THE MAIN SYSTEM 


Basic phoneme /n/ 100% 


THE REST 


pronounced 


Exceptions to main system 


Oddities 


2-phoneme 
graphemes 


<ng> /n/ or /gk/ 


<ngh> /n/ 
<ngu> /n/ 
<ngue> /n/ 


e.g. bang, sing, long, young, bung. 
Regular word-finally, with no 
exceptions (in RP). /g/ surfaces in 
long-er/est, strong-er/est, young-er/est, 
diphthongise, elongate, prolongation, 
and /d3/ in longevity - see section 
7.2. Medially in stem words, only 

in clangour, hangar, but there are 
thousands of occurrences in suffixed 
forms, e.g. clangorous, clingy, hanger, 
ringer, singer, singing, stinger, 
swinging, wringer. See Notes 


<1% 


only in length, lengthen, strength, strengthen. 
See under /n, k, n/, sections 3.4.5, 3.6.1, 
3.7.2 


only in dinghy, gingham, Singhalese /‘dini:, 
‘ginam, sina'lizz/ (contrast <ng, h> as 
separate graphemes in shanghai /fzn'hat/) 


only in a very few suffixed forms of words 
in next category, e.g. haranguing, tonguing. 
See also end of section 6.4 


only in harangue, meringue, tongue 
/ha'ren, ma'ren, tan/ (contrast <n, gu, e> as 
separate graphemes in dengue /'denget/) 


See <ng> possibly pronounced /nk/, four rows above, and Notes 
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NOTES 


Medially in stem and compound words, the letters <n, g> are always 
separate graphemes representing separate phonemes except in the words 
listed under exceptions to the main system and Oddities above. 

Before <e, i, y> the regular pronunciation of <n, g> is /ndz/ (e.g. Abinger, 
angel, congeal, danger, dungeon, engender, ginger, harbinger, messenger, 
tangent, engine, ingenious, laryngitis; dingy, stingy), i.e. <n, g> follow their 
main rules. Exceptions: 

1) <n, g> pronounced /ng/ before <e, i> (there appear to be no such 
cases before <y>): anger, conger, finger, hunger, linger, long-er/est, 
malinger, mangel, monger, strong-er/est, young-er/est; diphthongise, 
fungi - here the <n> has its regular pronunciation before <g> - see 
previous section, but the pronunciation of the <g> as /g/ is the irregular 
one before <e, i> 

2) <n, g> pronounced /nz/ before <e> (there appear to be no such cases 
before <i, y>): only in ingenue, lingerie pronounced /‘lengzari:/ (also 
pronounced /'lpndgaretr/) 

3) <n, g> pronounced /nd3/ before <e> (there appear to be no such cases 
before <i, y>): only in longevity 

4) <ng> pronounced /n/ before <e, i, y>): none in stem words, but as 
noted above there are hundreds of suffixed examples. 

Before <a, 0, u> and consonant letters the regular pronunciation is /ng/ 

(e.g. angle, elongate, England, fungus, language, langur, prolongation, 

single), i.e. the <n> has its regular pronunciation before <g> - see previous 

section, and the pronunciation of the <g> is also regular. Exceptions: 

1) <ng> pronounced /n/ before <a, o> (there appear to be no exceptions 
before <u>): only in clangorous, clangour, hangar 

2) <ng> pronounced /n/ or /nk/ before a consonant letter: see length, 
etc., in the Oddities. 

Word-finally, <n, ge> are always separate graphemes representing separate 
phonemes, with <n> always pronounced /n/ and <ge> usually pronounced 
/d3/ - but this is a small set: arrange, change, grange, mange, range, 
strange; flange, orange, phalange; challenge, revenge, scavenge; cringe, 
fringe, hinge, singe, swinge, tinge, whinge; sponge; lounge, scrounge; lunge, 
plunge. To avoid confusion with singing, swinging, the verbs singe, swinge 
retain the <e> before <-ing>: singeing, swingeing, as does spongeing to 
avoid the mispronunciation that might arise from *sponging. Exceptions: 

1) with final <n, ge> pronounced /n3/: only in melange 
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2) with final <n, g, e> as three separate graphemes: only in conge /'konzet1/ 
(sometimes spelt even within English text with French <é>). 


N.B. For once, one with their initial but unwritten /w/ see the entry for <o> 
in chapter 10, section 10.27; and for all the graphemes beginning <oi> 
which have correspondences beginning with consonant phoneme /w/ (<oi, 
oir, oire, ois>) see the entry for <oi> in chapter 10, section 10.29. 


9.25 <p, pp> 


N.B. <ph> has a separate entry. 


THE MAIN SYSTEM 


Only phoneme /p/ 100% e.g. apt, apple 
THE REST 
pronounced 
Exceptions to main (none) 
system 
Word-final doubled <ppe> /p/ only in grippe, steppe 


letter + <e> 


Oddities <pb> /b/ only in Campbell, cupboard, raspberry 
/’kemboal, 'kabod,'ra:zbri:/ 


<pe> /p/ only in canteloupe, troupe 
/‘keentalurp, trurp/ (contrast canape, 
recipe /‘keanapet, 'resipi:/). See Notes 


<pn> /n/ only word-initial and only in words derived 
from Greek trvebua pneuma (‘breath’) or 
TIvVevUWv pneumon (‘lung’), e.g. pneumatic, 
pneumonia 


<pph> /f/ only in sapphic, sapphire, Sappho 
/'sefik, 'seefata, 'sefau/ 


<ps>  /s/ only word-initial and only in some 
words of mainly Greek origin, e.g. psalm, 
psalter, psephology, pseud(o) and many 
compounds, psionic, psittacosis, psoriasis, 
psych(e/o) and many compounds, and a 
few more very rare words. /p/ surfaces in 
metempsychosis - see section 7.2 


<pt> 


2-phoneme graphemes (none) 


NOTES 
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/t/ 


only in Deptford, ptarmigan, pterodactyl 
(Greek, = ‘wing finger’), pterosaur (Greek, 
= ‘wing lizard’), Ptolem-y/aic, ptomaine, 
receipt and a few more very rare words. 
/p/ surfaces in archaeopteryx, helicopter, 
reception, receptive - see section 7.2 


In the vast majority of cases of word-final <p, e> the <e> is part of a 
split digraph (except canape (sometimes spelt even within English text with 
French <é>), recipe) and the <p> is a separate grapheme (including in 


canape, recipe). 


For <pa> in comparable, separate /'seprat/ (adjective), separatist, 
<pe> in deepening, desperate, halfpenny, opening, operable, operative, 
prosperous, temperament, temperature, twopenny, <pi> in aspirin, <po> 
in corporal, corporate, policeman pronounced /'pli:sman/, temporary see 


section 6.10. 
9.26 <ph> 


THE MAIN SYSTEM 


Basic phoneme _/f/ 


THE REST 


Exceptions to main 


system 
<ph> 
<ph> 
Oddities <phth> 


99% 


pronounced 


/p/ 


/v/ 


/t/ 


e.g. philosophy and many other words 
mainly of Greek origin 


<1% in total 


only in diphtheria, diphthong, naphtha, 
ophthalmic, shepherd. The first four also 
have pronunciations with /f/ - e.g. /'dif@pn/ 
versus /'dip@p0n/ 


only in nephew pronounced /'nevju:/ (also 
pronounced /'nefju:/), Stephen 


only in phthisic, phthisis pronounced 
/‘tarstk, 'tarsts/ 
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<phth> /6/ 
2-phoneme (none) 
graphemes 
NOTE 


only in apophthegm /'zpa8em/, phthalate 
/'‘Ozlert/ 


<p, h> are separate graphemes only at morpheme boundaries in compound 
words, e.g. cuphook, tophat. And <ph, th> are separate graphemes in some 


of the words listed just above. 


N.B. For <pp> see under <p>. 
9.27 <q> 


THE MAIN SYSTEM 


Only phoneme /k/ 100% 


THE REST 
pronounced 
Doubled letter (does 
not 
occur) 


Exceptions to main (none) 
system 


Oddities 


<qu> only /k/ 
(not /kw/) 


e.g. quick 


For percentages see Note 


occurs initially or medially (never 
finally) in about 46 words mainly 

of French origin, namely bouquet, 
conquer (/w/ surfaces in conquest - 
see section 7.2), Coquette, croquet, 
croquette, etiquette, exchequer, liqueur, 
liquor, liquorice, maquis, mannequin, 
marquee, marquetry, masquerade, 
mosquito, parquet, piquant, quatrefoil, 
quay, quenelle, quiche, so(u)briquet, 
tourniquet, and, in more conservative 


2-phoneme 
graphemes 


<que> 


(none) 
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as a trigraph 
pronounced 
only /k/ (not 
/kw/ plus 
vowel) 


speakers’ accents, questionnaire, quoits; 
medially also in applique, communique, 
manque, risque where the final <-e> 

is a separate grapheme (sometimes 
written even within English text as 
French <é>), unlike the words in the 
next paragraph; also phonemically 

but not orthographically word-final 

in opaque; claque, plaque; antique, 
bezique, boutique, clique, critique, 
mystique, oblique, physique, pique, 
technique, unique; toque; peruque; and 
a few more rare words where the final 
<e> is part of a split digraph with a 
preceding vowel letter spelling variously 
/@I, at, ix, au, ur/ 


occurs word-initially only in queue and 
medially only in milquetoast (where it is 
nevertheless stem-final in a compound 
word); otherwise only word-finally and 
only in about 18 words mainly of French 
origin, namely: 

(1) with a preceding consonant 

letter such that <que> could be 
replaced by <k> without changing 

the pronunciation: arabesque, barque, 
basque, brusque pronounced /brask/ 
(also pronounced/bru:sk/), burlesque, 
casque, catafalque, grotesque, 

marque, masque, mosque, picturesque, 
romanesque, statuesque, torque. 
However, in this group barque, basque, 
casque, marque, masque, torque are 
kept visually distinct from bark, bask, 
cask, mark, mask, torc; 

(2) with a preceding vowel letter with a 
short pronunciation such that <que> 
could be replaced by <ck> without 
changing the pronunciation: baroque, 
cheque (cf. US check), monocoque, 
plaque pronounced /plek/ (also 
pronounced /pla:k/) 
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NOTE 


Gontijo et al. (2003) do not recognise <que> as a separate grapheme. 
However, their calculations show that <qu, que> pronounced /k/ together 
constitute 9% of pronunciations of <qu> and that the other 91% of 
occurrences of <qu> are pronounced /kw/. 


9.28 <r, rr> 


Never word-final as separate graphemes. 


THE MAIN SYSTEM 

Only phoneme /r/ 100% e.g. very, berry 

THE REST 

pronounced 

Exceptions to main (none) 

system 

Word-final doubled <rre> /r/ occurs only in barre, bizarre, parterre, 

letter + <e> where it forms part of the four-letter 
graphemes <arre, erre> and is not 
pronounced /r/ (except that <rr> 
represents /r/ after /r/-linking in bizarrery 
- see section 3.6) 

Oddities <re> /a/ 100% of pronunciations of word-final <re>. 


Only word-final, and in that position almost 
entirely regular, e.g. centre, mitre. The only 
exceptions appear to be genre, macabre 
/'3onra, ma'karbra/, where <r, e> are 
separate graphemes representing separate 


phonemes 
<re> /r/ only in forehead pronounced /'forid/ 
<rh> /r/ only in words of Greek origin, e.g. 


rhinoceros, rhododendron. There are some 
2-phoneme exceptions at morpheme 
boundaries, e.g. poorhouse, warhorse 


<rrh> /r/ only medially and only in a few words 
of Greek origin, namely amenorrhoea, 
arrhythmia, cirrhosis, diarrhoea, 
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2-phoneme (none) 


graphemes 


NOTE 


gonorrhoea, haemorrhage, haemorrhoid, 
lactorrhoea, pyorrhoea, pyrrhic. N.B. In 
catarrh, myrrh <rrh> is not a separate 
grapheme, but forms part of the four- 
letter graphemes <arrh, yrrh> and is 
not pronounced /r/ (but in catarrhal 
/r/-linking occurs - see section 3.6) 


For full treatment of /r/-linking, implying when stem-final <r> is and is not 


pronounced, see section 3.6. 


9.29 <s, ss> 


N.B. <se, sh, si, ssi> have separate entries. 


THE MAIN SYSTEM 

Basic phoneme /s/ <s> 56%, 
<ss> 89% 

Other phonemes /z/ 43% 


for <s> 


e.g. cats, grass. For <s>, except 
within split digraphs, /s/ is regular 

in all positions, including when 

<s> is a grammatical suffix ora 
contracted form after voiceless non- 
sibilant consonants. Only exceptions 
in word-initial position: sorbet 
(sometimes), sugar, sure and German 
pronunciations of sauerkraut, spiel, 
Stein, strafe, stumm. For medial and 
final positions see Notes and Table 9.5. 
For <ss> see the exceptions to the 
main system, and <ssi>, section 9.32 


e.g. dogs. Never word-initial (except 
in sorbet pronounced /'z>:bet/ (also 
pronounced /'sd:be1/) and German 
pronunciation of sauerkraut). Regular 
within split digraphs, and when <s> 
is a grammatical suffix or a contracted 
form after stem-final vowels and 


314 Dictionary of the British English Spelling System 


/3/ <1% 

THE REST 
pronounced 

Exceptions to main system 

<s> /J/ 

<ss> /J/ 

<ss> /z/ 
Word-final doubled <sse> /s/ 


letter + <e> 


voiced non-sibilant consonants. For 
final position otherwise and medial 
position, see Notes and Table 9.5 


always preceded by a vowel letter and 
followed by <ua, ur>; only medial and 
only in casual, sensual, usual, visual: 
(dis/en/fore-)closure, com/ex-posure, 
embrasure, erasure, leisure, measure, 
pleasure, treasure(n, treasury, 
usur-y/er/ious. Despite its rarity in 
the grapheme-phoneme direction, 
this correspondence belongs in the 
main system because of its status as 
a main-system correspondence in the 
phoneme-grapheme direction - see 
section 3.8.4 


See also Table 9.5 


<1% of pronunciations of <s>. Only 

in (initially) sugar, sure, and German 
pronunciations of spiel, stein, strafe, stumm; 
(medially) asphalt pronounced /'zJfelt/ 
(also pronounced /'xsfelt/), censure, 
commensurate, ensure, insure, tonsure 


7% of pronunciations of <ss>. Only in assure, 
fissure, issue, pressure, tissue 


5% of pronunciations of <ss>. Only in Aussie, 
brassiere, dessert, dissolve (but contrast 
dissolution, with /s/), hussar, Missouri, 
possess (first <ss>), scissors 


except in divertissement, only word-final, 
e.g. bouillabaisse, crevasse, duchesse, 
finesse, fosse, impasse, lacrosse, largesse, 
mousse, noblesse, palliasse, wrasse and a 
few more rare words (and contrast retrousse 
/ra'trurser/, sometimes spelt even within 
English text with French <é>) 
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Oddities <sc> /s/ 98% of pronunciations of <sc>, but see 
Notes. Regular before <e, i, y>, e.g. ascend, 
disciple, scythe. \rregularly, also in corpuscle, 
muscle; /k/ surfaces in corpuscular, 
muscular - see section 7.2. Exception: 
Sceptic, with /sk/, which is also the regular 
pronunciation (following the general rules 
for <s, c>) before <a, 0, u> (corpuscle, 
muscle appear to be the only occurrences of 
<sc> before a consonant letter). For other 
exceptions see next 2 paragraphs 


<sc> /f/ 1% of pronunciations of <sc>. Only in 
conscie, conscientious, crescendo, fascis-m/t 


<sc> /z/ <1% of pronunciations of <sc>. Only 
in crescent pronounced /'krezant/ (also 
pronounced /'kresant/) 


<sce> /s/ only word-finally in verbs ending <-esce>, 
e.g. acquiesce, coalesce, convalesce, 
deliquesce, effervesce, evanesce and some 
other very rare words, plus reminisce. The 
final <e> surfaces as /a/ in some suffixes, 
e.g. convalescent - see section 7.2 


<sch> /J/ only in maraschino, meerschaum, schedule, 
schemozzle, schist, schistosomiasis, schlemiel, 
schlep, schlock, schmaltz, schmo(e), 
schmooze, schnapps, schnauzer, schnitzel, 
Schnozzle, schuss, schwa, seneschal. Except 
in these words and schism (next paragraph) 
and in a few cases across a morpheme 
boundary (discharge, escheat, eschew, 
mischance, mischief, mischievous, with /stf/), 
<s, ch> is always pronounced /sk/, e.g. 
school. For absence of percentages here and 
in next paragraph see Notes 


<sch> /s/ only in schism pronounced /'stzam/ 


<sci> /J/ only in conscience, conscious, fascia, luscious 
/'konfans,'konfas, 'ferfa, ‘lAfas/ 


<sj> /J/ only in sjambok /'fzambok/ 


<st> /s/ regular before final <-en, -le>, e.g. 
chasten, christen, hasten, fasten, glisten, 
listen, moisten (exception: tungsten); castle, 
forecastle (whether pronounced /'fauksal/ or 
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2-phoneme grapheme 


NOTES 


<sth> /s/ 


<sw> /s/ 


<s> 


/1z/ 


/'fa:karsal/), nestle, pestle, trestle, wrestle, 
bristle, Entwistle, epistle, gristle, thistle, 
whistle, apostle, jostle, throstle, bustle, hustle, 
rustle; otherwise only in chestnut, Christmas, 
durstn’t, dustbin, dustman, mistletoe, 
mustn’t, ostler, Postlethwaite, Thistlethwaite, 
Twistleton, waistcoat pronounced /'weiskaut/ 
and sometimes ghastly. /t/ surfaces in 
apostolic, epistolary - see section 7.2 


only in asthma, isthmus if pronounced 
without /6/ 


only in answer, coxswain, sword /'a:nsa, 
‘koksan, sd:d/ and boatswain pronounced 
/‘bausan/ (also pronounced /'bautswein/) 


only, following an apostrophe, in regular 
singular and irregular plural possessive forms 
after a sibilant consonant (/s, z, J, 3, tf, d3/), 
e.g. Brooks’s (book), jazz’s (appeal), Bush’s 
(government), (the) mirage’s (appearance), 
(the) Church’s (mission), (the) village’s 
(centre), (the) geese’s (cackling) 


Given that /s/ is the regular pronunciation of medial <s>, Table 9.5 


lists categories where medial <s> is instead pronounced /z/, plus sub- 


exceptions with /s/ (and a very few sub-sub-exceptions with /z/). 


And given that /s/ is the regular pronunciation of word-final <s> 


(including when it is a grammatical suffix or contracted form after a voiceless 


non-sibilant consonant), here is a list of categories where word-final <s> 
is instead pronounced: 


+ /z/ 


1) regularly after vowels and voiced non-sibilant consonants when 


<s> is a grammatical suffix (regular noun plural and third person 


singular present tense verb and, following an apostrophe, regular 


singular and irregular plural possessive) or contracted from is, has. 


This includes plurals in <-es> pronounced /i:z/ of words of Greek 


and Latin origin which have singulars in <-is> pronounced /1s/, 


e.g. axes, crises, diagnoses, testes 


2) in a few function words: always, as, his, sans, and cos where this is 


the abbreviation of because 
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3) plus a few content words: lens, missus, and series, species (whether 
singular or plural), plus cos, the lettuce and the abbreviation of 
cosine, which vary in pronunciation between /koz/ and /kps/ 

/1z/ - see the 2-phoneme pronunciation above. 

For <(s)sa> in adversary, emissary, necessary, <(s)so> in promissory, 
reasonable, seasoning, <ste> in christening, listener, listening see section 
6.10. 

The percentages of /f, z/ as pronunciations of <ss> are due solely to the 
high frequencies of a few words with these correspondences. 

The percentages for <sc> depend on recognising it as a digraph rather 
than as two single-letter graphemes. However, the fact that it is mainly a 
digraph before <e, i, y> and hardly ever a digraph elsewhere helps with this. 

Gontijo et al. (2003) state that /s/ accounts for 96% of pronunciations of 
<sch> and /Jf/ for only 4% - but since <sch> pronounced /s/ occurs only in 
schism their corpus must have been very strange in this respect. 


TABLE 9.5: MEDIAL <s> PRONOUNCED /z/, WITH SUB-EXCEPTIONS PRONOUNCED 
/s/ AND SUB-SUB-EXCEPTIONS PRONOUNCED /2z/. 


For other exceptions see above. 


Categories where medial <s> is 
exceptionally pronounced /z/ 


Sub-exceptions where medial <s> is 
pronounced /s/ (with a few sub-sub- 
exceptions with /z/) 


Almost always before <b> and always 
before <d, g, |, m>), but except 

before <m>, where there are hundreds 
of examples (e.g. chasm, prism, 
seismic, talisman), this is a small set: 
asbestos, busby, husband, lesbian, 
presbyter, presbyterian pronounced 
/prezbr'ttari:jan/, raspberry (taking 
<pb> to be a spelling of /b/); Tuesday, 


Wednesday, Thursday, wisdom; phosgene; 


gosling, grisly, Islam, measles, measly, 
muslim, muslin, Oslo (but the Norwegian 
pronunciation has /s/), quisling 


only in presbyterian pronounced 
/prespr'trarisjan/, where the <b> also 
devoices, unusually, to /p/ 


Mostly after <m>, e.g. crimson, flimsy, hamster 
helmsman, whimsical, whimsy 
Mostly after <w>, e.g. blowsy, drowsy, frowsty 


frowsy 


318 Dictionary of the British English Spelling System 


TABLE 9.5: MEDIAL <s> PRONOUNCED /z/, WITH SUB-EXCEPTIONS PRONOUNCED 
/s/ AND SUB-SUB-EXCEPTIONS PRONOUNCED /z/, CONT. 


Categories where medial <s> is 
exceptionally pronounced /z/ 


Sub-exceptions where medial <s> is 
pronounced /s/ (with a few sub-sub- 
exceptions with /z/) 


In the prefix <trans-> where the following 
phoneme is a vowel or a voiced consonant, 
e.g. transact, transgress, transit(ion), 
translate, transmit, transmute 


transitive, transom 


Mostly between vowel letters 


Where the following letter is <e, i> followed 
by another vowel letter - see the main entries 
for <se, si>; 

In compounds, e.g. aforesaid, antiseptic, 
beside, research, 

Always in the endings <-osity, -sive, -some>; 
Mostly in the ending <-sy> (sub-sub- 
exceptions with /z/: busy, cosy, daisy, poesy, 
posy, queasy, and derived forms such as 
cheesy, easy, lousy (despite the /s/ in louse - 
see Notes to next section), noisy, nosy, prosy, 
rosy); 

In prefix <dis-> (sub-sub-exceptions with 
/z/: disaster, disease); 

In prefix <mis->; 

In a set of Greek words ending <-sis> in 
singular and <-ses> in plural: analysis, basis, 
crisis, diagnosis, emphasis, oasis, prognosis, 
thesis: 

Plus asylum, basin, bison, chrysalis, 
comparison, crusade, desecrate, desolate, 
desultory, dysentery, episode, gasoline, 
garrison, isolate, isosceles and other words 
beginning <iso->, kerosene, mason, 
nuisance, palisade, parasite, parasol, 
philosophy, prosecute, sausage, unison and 
sometimes venison 

In the ‘sugar’ words dextrose, glucose, 
lactose, sucrose the ending <-ose> can be 
pronounced /aus/ or /auz/ and this may also 
be true of many of the (mostly rare) adjectives 
ending in <-ose> - but morose, verbose (at 
least) have only /aus/ 


In a few other odd words: absolve, absorb, 
absorption, bowser, geyser, hawser, observe, 


palsy, pansy, tansy 
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9.30 <se> 


Never initial. 


THE MAIN SYSTEM 


For both categories see Notes and Table 9.6. For the absence of percentages 


see Notes. 
Basic phoneme _ /s/ only word-final. Regular after a consonant 
letter; otherwise unpredictable 
Other phoneme /z/ only word-final. Regular (no exceptions) after 
<ai, au, ui>, but this covers only 10 words; 
otherwise unpredictable 
THE REST 
pronounced 
Exceptions to main (none) 
system 
Oddities (N.B. All medial, therefore not classified as 
exceptions to main system) 
<se> /S/ only in gaseous pronounced /'getfas/ (also 
pronounced /'gez'si:jas/) 
<se> /z/ only in gooseberry /'guzbri:/, housewife 
‘sewing kit’ pronounced /‘hazif/ 
<se> /3/ only in nausea, nauseous pronounced 
/'nd:3a(s)/ (also pronounced /'nd:zizja(s)/) 
2-phoneme (none) 
graphemes 
NOTES 


Gontijo et al. (2003) do not recognise <se> as a separate grapheme, hence 
the absence of percentages. | have based my choice of /s/ as the basic 
phoneme for <se> on its predominance in Table 9.6. This is admittedly a 
sort of lexical, rather than a text, frequency (see section 3.3). 

Initial <s, e> and (except in the few Oddities listed) medial <s, e> always 
are/ belong to separate graphemes. Word-finally, the only words in which 
<sS, e> are separate single-letter graphemes appear to be tsetse, usually 
pronounced /'tetsi:/ and the three French loanwords blase, expose (‘report 
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of scandal’) and rose (‘pink wine’), with /e1/ (increasingly spelt even within 
English text with French <é>). In almost all other cases of final <s, e> the 
<e> is part of a split digraph and the <s> is a single-letter grapheme - 
see previous section. Part of my definition of a split digraph (see section 
A.6 in Appendix A) is that the leading letter is not preceded by another 
vowel letter. This makes it easy to define and identify almost all the words 
ending <se> where these letters do form a digraph, namely those where 
<-se> is preceded by two vowel letters or a consonant letter: see again 
Table 9.6, which also distinguishes the relevant words according to /s, z/ 
pronunciations. 

In the last row of the table are listed the only eight words in which the 
vowel letter before the <s> is a single vowel letter preceded in turn by a 
consonant letter, so that that vowel letter and the final <e> look as though 
they ought to form a split digraph, but do not; these are the only exceptions 
to my definition of grapheme <se> just above besides the four words listed 
earlier in the previous paragraph. 

Given that the pronunciation of houseas averbis /hauz/, the pronunciation 
of houses /‘hauziz/ as a singular verb is regular, but as a plural noun shows 
avery rare irregularity: if it were regular it would be /‘hausi1z/ (the noun stem 
/haus/ plus the plural ending /1z/ which is regular after sibilant consonants). 
The voicing of the stem-final consonant is shared only with some words 
ending in /f/ in the singular but /vz/ in the plural, e.g. leaf/leaves, or in /0/ 
in the singular but /6z/ in the plural (in RP), e.g. bath(s), plus lousy with /z/ 
from louse with /s/ (and contrast mous(e)y with /s/). 


TABLE 9.6: /s, z/ AS PRONUNCIATIONS OF WORD-FINAL <se>. 
/s/ /z/ 


After <ai, au, ui> (none) all, but this is a small set: 


appraise, braise, chaise, 
praise, applause, cause, 
clause, pause; bruise, cruise 


After <ea, ee, oi, 00, OU, U> | Cease, crease, decease, appease, ease, please, tease; 
decrease, grease, cheese; noise, poise; choose; 
increase, lease, release; | arouse, blouse, carouse, 
geese; porpoise, tortoise; | espouse, rouse, plus house 
goose, loose, moose, /hauz/ as a verb and 


noose, vamoose; douse, (suffixed) houses /'hauziz/ 
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grouse, louse, mouse, 
Scouse, souse, spouse, 
plus house /haus/ as 
a singular noun (see 
Notes); use (noun) 


as a plural noun and 
singular verb (see Notes); 
fuse, muse, use (verb) 


After <r, w> (which here 
always form part of a vowel 
digraph) 


all except those shown 
on right, including 
dowse (/daus/ ‘splash 
with water’, variant 
spelling of douse) 


only in parse; hawse, tawse; 
browse, dowse (/dauz/ 
‘detect water’), drowse 


After any other consonant 
letter 


all except cleanse 


only in cleanse 


After consonant + vowel, so 
looking as though there is a 
split digraph 


all, but this is a small 
set because final <e> 
after <s> is normally 
part of a split digraph 
(see Notes above Table 
and previous section): 
carcase, purchase; 
diocese /‘datastis/; 
mortise, practise, 
premise, promise, 
treatise; purpose 


(none) 


For <se> in arsenal, arsenic see section 6.10. 


9.31 <sh> 


Only phoneme /S/ 100% e.g. ship, fish 


NOTE 


The only cases where, exceptionally, <s, h> do not form a digraph but 
belong to separate graphemes are at morpheme boundaries in compound 
words, e.g. mishandle, mishap, mishit. \n dishonest, dishonour, however, 
there is no /h/ phoneme, so the letter <h> is (according to your analysis) 
either ‘silent’ or part of a grapheme <ho> pronounced /v/. | prefer the 
latter analysis - see /p/, section 5.4.4, and <h>, section 9.16. 
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9.32 <si> 


Only medial. 


THE MAIN SYSTEM 


Basic phoneme 


Other phoneme 


/3/ 


/S/ 


55% 


45% 


regular when both preceded 

and followed by vowel letters, 
e.g. vision. In all such words 

the stress falls on the vowel 
preceding /3/ spelt <si>, and 
that vowel is always spelt with a 
single letter and has its letter- 
name pronunciation, e.g. evasion, 
cohesion, erosion, collusion, except 
that <i> is always short /1/, e.g. 
collision. See Notes 


regular between a preceding 
consonant letter (which is always 
one of <I, n, r>) and a following 
vowel letter, e.g. emulsion, 
repulsion; pension, tension; 
aversion, controversial, excursion, 
reversion, torsion, version. In all 
these cases the stress falls on the 
vowel preceding <I, n, r>. Also, 
where the preceding consonant 
letter is <I, n> the preceding 
vowel is spelt with a single letter 
which has its ‘short’ pronunciation; 
where the consonant letter is <r> 
it forms a digraph with the vowel 
letter and the pronunciation is 
either /3:/ where the digraph 

is <er, ur> or /d:/ where it is 
<or> (there are no words ending 
<-arsion, -irsion>). See Notes 
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THE REST 
pronounced 
Exception to main medial <si> /z/ only in business. See also section 6.10 
system 
Oddities (none) 


2-phoneme graphemes (none) 


NOTES 


<s, i> never form a digraph word-initially or -finally; medially they form a 
digraph only when followed by stem-final <-on>, plus business, controversial. 

Given that the contexts in which the two pronunciations occur are almost 
entirely distinct, <si> is almost 100% predictable. The only exception is that 
version is now often pronounced /'v313an/ rather than /'v3:fan/. 


N.B. For <ss> see under <s>. 


9.33 <ssi> 


Only medial. 


Only phoneme ify 100% regular when both preceded and 
followed by vowel letters, e.g. 
accession, admission, discussion, 
fission, intercession, obsession, 
passion, percussion, permission, 
recession, remission. Exception: 
dossier, in either pronunciation 
(/‘dosizja, 'dosixjer/). In all 
these cases, including dossier, 
the stress falls on the vowel 
preceding/J/ spelt <ssi>, and 
that vowel is spelt with a single 
letter which has its ‘short’ 
pronunciation 


NOTE 


In all other cases, <ss, i> are/belong to separate graphemes, e.g. in missile, 
passive. 
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9.34 <t, tt> 


N.B. <tch, th, ti> have separate entries. 


Basic /t/ <t>94%, e.g. rat, rattle 

phoneme <tt> 100% 

Other /tf/ 2% of regular before <u> followed 
phoneme for pronunciations by either another vowel letter 
<t> of <t> or a single consonant letter 


and then a vowel letter, e.g. 

(in initial position) tuba, tube, 
tuber, Tuesday pronounced 
/‘t{urzdi:/, tuition pronounced 
/t{u:'wifan/, tulip, tumour, 
tumult, tumultuous, tumulus, 
tuna, tune pronounced /'tf{urn/, 
tunic, tureen, tutor, (medially) 
impromptu; gargantuan, 
perpetuate, attitude, multitude, 
solitude; statue, virtue; habitue; 
intuition, pituitary, costume; 
fortunate, fortune, importune, 
opportune; capture, mature 
and dozens of other words in 
<-ture> and derivatives such 
as adventurous(ly), natural(ly); 
centurion, century, saturate; 
virtuoso; obtuse; 
de/in/pro/re/sub-stitution; 
also in several groups where 
the stress is always on the 
syllable preceding /tf{/ spelt 
<t>: actually), perpetually), 
virtuaKly) and several other 
words in <-tual(ly)>; actuary, 
estuary, mortuary, obituary, 


THE REST 


Exceptions to 
main system 


<t> 


<t> 


pronounced 


/f/ 


/s/ 
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sanctuary, statuary, voluptuary, 
congratulate, fistula, petulan-t/ 
ce, postulant, postulate, spatula, 
titular, contemptuous, fatuous, 
impetuous, tempestuous, 
tumultuous (again) and several 
other words in <-tuous>. 
Though rare in this direction, 
this correspondence qualifies 
as part of the main system 
because of the high frequency 
and predictability of /t{/ spelt 
<t> - see section 3.7.2 


5% of pronunciations of <t>. Mainly 
before <iat> with the <i> pronounced 
/it/, e.g. differentiate, expatiate, 
ingratiate, initiate, negotiate, 
propitiate, satiate, substantiate, vitiate, 
plus minutiae, otiose pronounced 
/‘avfitjaus, 'aufizjauz/ (also 
pronounced /'avutizjaus, 'auti:jauz/), 
partiality, ratio. Partial exceptions: 
novitiate can be pronounced with or 
without the /i:/: /na'vifisjat, na'vifat/ 
and can therefore follow either this 
rule or the main rule for <ti>, see 
section 9.37; also, some of the words 
listed have alternative pronunciations 
with /s/, e.g. negotiate, substantiate 
as either /ni'gaufiijert, sab'steenfisjert/ 
or /nt'gausi:jert, sab'staensi:jert/. See 
also next paragraph 


<1% of pronunciations of <t>. Only 
the penultimate <t> in about 10 
words ending in <-tiation>, e.g. 
differentiation, initiation, negotiation, 
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Word-final 
doubled letter 
+ <e> 


Oddities 


<tte> 


<te> 


/t/ 


/t/ 


propitiation, transubstantiation, and 
only for RP-speakers who avoid having 
two occurrences of medial /J/ in such 
words (see Notes under /J/, section 
3.7.3), plus a few words where <t> 

is alternatively pronounced /J/ - see 
previous paragraph. In French, on the 
other hand, /s/ is one of the most 
frequent correspondences for <t> 


only word-final, e.g. cigarette, 
gavotte. All such words have stress 
on the syllable ending in /t/ spelt 
<-tte> except etiquette, omelette, 
palette, which have stress on the first 
syllable. In latte <tt, e> are separate 
graphemes, as are <u.e, tt> in butte 


mainly word-final and in that position 
in at least 120 words, namely 
- Bacchante, composite, compote, 
confidante, cote, debutante, definite, 
detente, dirigiste, enceinte, entente, 
entracte, exquisite, favourite, granite, 
hypocrite, infinite, minute (‘sixtieth 

of an hour’), opposite, perquisite, 
plebiscite, pointe, requisite, riposte, 
route, svelte 
- about 30 nouns/adjectives in <-ate> 
pronounced /at/ where the verbs with 
the same spelling are pronounced 
with /ert/, e.g. advocate, affiliate, 
aggregate, alternate (here with alsoa 
difference in stress and vowel pattern: 
noun/adjective pronounced /9:I't3:nat/, 
verb pronounced /'s:ltaneit/), animate, 
appropriate, approximate, articulate, 
associate, certificate, coordinate, 
curate (here with also a difference 

in meaning and stress: noun (‘junior 
cleric’) pronounced /'kjuarat/, verb 
(‘mount an exhibition’) pronounced 
/kjua'reit/), degenerate, delegate, 
deliberate (here with also a difference 
in syllable structure: adjective 
/dr'ltbrat/ with three syllables 
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and an elided vowel - see section 6.10; 
verb /dr'lrbareit/ with four syllables), 
designate, desolate, duplicate, 
elaborate, estimate, expatriate, 
graduate, initiate, intimate, legitimate, 
moderate, pontificate (here with 
unrelated (?) meanings: noun (‘pope’s 
reign’) pronounced /pon'trfiket/, verb 
(‘speak pompously’) pronounced 
/pon'tiftkert/), precipitate (but here 
only the adjective has /at/; the noun as 
well as the verb has /ert/), predicate, 
separate (here too with a difference in 
syllable structure: adjective /'seprat/ 
with two syllables and an elided vowel 
- see section 6.10; verb /'separert/ with 
three syllables), subordinate, syndicate, 
triplicate. In the verbs and the many 
other nouns and adjectives with this 
ending pronounced /ert/, <e> is part 
of the split digraph <a.e> pronounced 
/et/ and the <t> on its own is 
pronounced /t/ 
- a further set of at least 60 nouns/ 
adjectives in <-ate> pronounced 

/at/ with no identically—-spelt verb, 

e.g. accurate, adequate, agate, 
appellate, celibate, climate, collegiate, 
conglomerate, (in)considerate, 
consulate, consummate, delicate, 
desperate, (in)determinate, directorate, 
disconsolate, doctorate, electorate, 
episcopate, extortionate, fortunate, 
illegitimate, immaculate, immediate, 
inanimate, in(sub)ordinate, inspectorate, 
intricate, inviolate, (bacca)laureate, 
legate, (illiterate, novitiate, obdurate, 
palate, particulate, (com/dis) passionate, 
private, profligate, proletariate, 

(dis) proportionate, protectorate, 
proximate, roseate, senate, surrogate, 
(in)temperate, triumvirate, ultimate, 
(in) vertebrate (a few of these words 
have related verb forms with <-ate> 
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<te> 
<ts> 
<tsch> 
<tw> 
2-phoneme (none) 
graphemes 
NOTES 


/tf/ 
/z/ 
/tf/ 
/t/ 


pronounced /eit/: animate, legitimate, 
mediate, subordinate, violate) 
- possibly just one word where 

both noun and verb have <-ate> 
pronounced /at/: pirate 
-pronounced also occurs medially 

in a few words in rapid speech, e.g. 
interest, literacy, literal, literary, 
literature, sweetener, veterinary - see 
section 6.10. 

In all cases where is pronounced the 
is phonographically redundant, but in 
a couple it makes the words visually 
distinct from words without the and 
with an unrelated meaning: point, rout. 
Exceptions where word-final <t, e> 
are separate graphemes: coyote, 
dilettante, (piano)forte, karate, 
machete /ma'feti:/, and the French 
loanwords diamante, naivete, pate 
(‘paste’), saute (sometimes spelt even 
within English text with French <é>) 


only in righteous 
only in tsar 
only in kitsch, putsch 


only in two and derivatives, e.g. 
twopence, twopenny. /w/ surfaces 
in between, betwixt, twain, twelfth, 
twelve, twenty, twice, twilight, twilit, 
twin - see section 7.2 


For <ta> in budgetary, commentary, dietary, dignitary, fragmentary, 


hereditary, military, momentary, monetary, pituitary, planetary, proprietary, 


salutary, sanitary, secretary pronounced /'sekratri:/, sedentary pronounced 
/‘sedantri:/ (also pronounced /si'dentari:/), solitary, tributary, unitary, 
voluntary, <tau> in restaurant, <(t)te> in cemetery, dysentery, entering, 


et cetera, interest, literacy, literal, literature, literary /'‘\ttrari:/, monastery, 
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mystery /'mistri:/, presbytery, sweetener, veterinary /'vetrinri:/, utterance, 
<to> in amatory, auditory, conciliatory, conservatory, contributory, 
declamatory, defamatory, de/ex/re/sup-pository, desultory, dilatory, 
dormitory, explanatory, exploratory, factory /'feektri:/, history /‘htstri:/, 
inflammatory, inhibitory, interrogatory, inventory pronounced /‘1nvantri:/ 
(also pronounced /t1n'ventari:/), laboratory, lavatory, mandatory, nugatory, 
obligatory, observatory, offertory, oratory, predatory, preparatory, 
promontory, purgatory, repertory, retaliatory, signatory, statutory, 
territory, transitory, victory /'viktri:/, <tu> in accentual, actual(ly), actuary, 
adventurous(ly), conceptual, contractual, effectual, estuary, eventual, 
factual, habitual, intellectual, mortuary, mutual, naturally), obituary, 
perpetual, punctual, ritual, sanctuary, statuary, spiritual, textual, virtual, 
voluptuary see section 6.10. 

All the words in which <t> is pronounced /tf{/ were formerly pronounced 
with the sequence /tj/, and conservative RP-speakers may still pronounce 
them that way (or imagine they do). Pronunciations with /tj/ would require 
an analysis with the <t> pronounced /t/ and and the /j/-glide as part of 
the pronunciation of the <u> and following <r> or vowel letter. See <d>, 
section 9.11, for the largely parallel correspondence to voiced /d3/, <di> in 
the Oddities there, and <ti>, section 9.37. 


9.35 <tch> 


Only phoneme /t{/ 100% e.g. match 


NOTE 


There appear to be no cases where <t, ch> are separate graphemes. 


9.36 <th> 


THE MAIN SYSTEM 
Basic /0/ 88% in all (content and function) words ending 
phoneme in <-ther>, e.g. brother, either, except 


anther, ether, panther, and in all function 
words (except both, through and Scots 
outwith), i.e. although, than, that, the, 


330 Dictionary of the British English Spelling System 


Other /8/ 
phoneme 


THE REST 


Exceptions to 
main system 


<th> 


<th> 


<th> 


12% 


pronounced 


/t/ 


/tf/ 


thee, their, them, then, thence, there, 
these, they, thine, this, thither, those, thou 
(archaic second person singular pronoun), 
though, thus, thy, with, without; also 

in avery few other stem content words, 
namely algorithm, bequeath, betroth (but 
troth has /@/), booth, brethren, farthing, 
fathom, heathen (but (unrelated) heath 
has /6/), mouth /mavud/ (verb), oath 
/av00/ (verb), rhythm, smithereens, smooth, 
swarthy, withy and derivatives, e.g. 
betrothal, plus some other derived forms: 
earthen, loathsome, norther-n/ly, smithy, 
souther-n/ly, worthy, even though their 
stems earth, loath, north, smith, south, 
worth have /@/. Also, in RP, in plurals 

of some nouns which have /8/ in the 
singular, e.g. baths, oaths, paths, youths 
/bar6z, av6z, par6z, jurdz/ 


in three function words (both, through and 
Scots outwith) and in most content words, 
e.g. anther, ether, methane, method, 
mouth /mau8/ (noun), oath /au8/ (noun), 
panther, pith, thigh, thin, thou (informal 
abbreviation meaning ‘thousandth of an 
inch/thousand pounds/dollars’), threw 


<1% in total 


only in Thai, thali, Thame, Thames, Therese, 
Thomas, thyme, Wrotham /'ru:tam/ 


only in posthumous /'post{fameas/ 


as 2-phoneme only in eighth /ert0/ 


sequence /t0/ 
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Oddities <the> /6/ only in Catherine with first <e> elided (see 
section 6.10), saithe (/se16/, ‘fish of cod family’) 


<the> /8/ only word-final and only in breathe, loathe, 
seethe, sheathe, soothe, staithe, teethe, wreathe. 
Only exceptions: absinthe /zb'sznt/, (the river) 
Lethe /'li:8i:/ (in Greek mythology), nepenthe 
/ne'pen@i:/ 


2-phoneme (see above) 
grapheme 


NOTES 


The communicative load of the /8, 6/ distinction is very low - there are 
remarkably few minimal pairs differing strictly and only in these phonemes; 
even scraping the dictionary for rare words | have managed to identify only 10 
such pairs. The only ones which are also identical in spelling appear to be mouth, 
oath, thou (for the distinctions in use/meaning see above), and the only pairs 
which are not identical in spelling appear to be /o(a)th/loathe, sheath/sheathe, 
teeth/teethe, wreath/wreathe, where the words in each pair are related in 
meaning, plus ether/either pronounced /'i:da/ (also pronounced /'a1da/), 
sooth/soothe, thigh/thy, where they are not. Other pairs differing visually 
only in the absence or presence of final <e> (bath/bathe, breath/breathe, 
cloth/clothe, lath/lathe, swath/swathe) have a further phonological difference 
in the pronunciation of the preceding vowel grapheme; similarly, seeth 
(/’sixj1@/, archaic 3' person singular of see) differs from seethe /sixd/ in 
having two syllables rather than one. 

The only cases where <t, h> do not form a digraph are at morpheme 
boundaries in compound words, e.g. adulthood, bolthole, carthorse, 
coathook, goatherd, hothouse, meathook, pothole, warthog. 

For <tho> in catholic (as well as <the> in Catherine), see section 6.10. 


9.37 <ti> 


Only medial. For all categories see Notes. 


THE MAIN SYSTEM 


Basic phoneme /f/ 94% regular when followed by <a, e, o>, e.g. 
confidential, inertia, infectious, nation, 
quotient, cf. Ignatius 
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THE REST 
pronounced 

Exceptions to <ti> /tf/ 5% Regular when preceded by <s> and 

main system followed by <o>, but occurs only in 
combustion, con/di/indi/in/sug-gestion, 
exhaustion, question, rumbustious, plus 
Christian 

<ti> /3/ <1% only in equation 

Oddities (none) 

2-phoneme (none) 

graphemes 

NOTES 


Given the different contexts in which /J, t{/ occur, these pronunciations are 
almost 100% predictable. 

In all cases other than those defined above, <t, i> are separate graphemes, 
e.g. in consortium pronounced /kan'so:ti:jam/ (also but less often pronounced 
/kan'sa:fem/), till, native; also in a few words which are exceptions to the 
main rule above: cation /'ketatan/, consortia pronounced /kan'so:titja/ (less 
often but, by the main rule above, more regularly pronounced /kan'so:fa/), 
fortieth, otiose, pitiable; and in two words which are sub-exceptions to <ti> 
pronounced /t{/, namely bastion /‘besti:jan/, Christianity /kristix'jenttix/; 
also, the first <ti> is pronounced /si:/ in about 10 words ending in <-tiation>, 
e.g. differentiation, initiation, negotiation, propitiation, transubstantiation, 
but only by RP-speakers who avoid having two occurrences of medial /J/ in 
a word of this sort. See also sections 3.7.3 and 9.35. 


N.B. For <tt> see under <t>. 
N.B. Though <u, u.e> have or are involved in various consonantal 


pronunciations see, nevertheless, the entries for <u, u.e> in chapter 10, 
sections 10.36, 10.38. 


9.38 <v> 


N.B. <ve> has a Separate entry. 


The grapheme-phoneme correspondences, 1 333 


THE MAIN SYSTEM 
Basic phoneme __ /v/ 100% e.g. very, oven 
THE REST 
pronounced 
Exception to main <v> /f/ only in kvetch, svelte, svengali, veldt 
system 
Doubled letter <vw> Iv/ only in bevvy, bower, chavvy, chivvy, 


civyy, divvy, flivver, lavvy, luvv-y/ie, navvy, 
revving, savvy, skivvy, spivvery, spivvy 


Word-final doubled (does not occur) 
letter + <e> 


Oddities (none) 
2-phoneme (none) 
graphemes 

NOTE 


For <vou> in favourable, favourite see section 6.10. 


9.39 <ve> 

Only phoneme /v/ never initial; for medial position see Notes; 
frequent word-finally 

NOTES 


<ve> pronounced /v/ occurs medially in average, deliverable, evening 
(noun, ‘late part of day’, pronounced /‘itvnin/, as distinct from the verb 
of the same spelling, ‘levelling’, pronounced /‘i:vanin/), every, several, 
sovereign (for these words see also section 6.10), and in a large number 
of regular plural nouns and singular verbs, e.g. haves (vs have-nots), gives, 
grieves, initiatives, dissolves, lives (verb), loves, improves, stoves, preserves, 
mauves, gyves; also in a small number of irregular plural nouns ending in 
<-ves> pronounced /vz/ where the singular forms have <-f> pronounced 
/f/, namely calves, dwarves (the form dwarfs also exists), elves, halves, 
hooves, leaves, loaves, scarves, (our/your/them-)selves, sheaves, shelves, 
thieves, turves (the form turfs also exists), wharves, wolves, plus a very few 
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nouns where the <f> in the singular is within the split digraph <i.e>: knives, 
lives (/laivz/; the singular verb of the same spelling is pronounced /I1vz/), 
(ale/good/house/mid-)wives (but if housewife ‘sewing kit’ pronounced 
/‘hazif/ has a plural it is presumably pronounced /'hazifs/). 

In only 33 words, in my analysis (behave, conclave, forgave, gave, shave, 
Khedive, suave, wave; breve, eve; alive, archive, arrive, deprive, drive, five, 
hive, jive, live (adjective, /latv/), naive, ogive, recitative, revive, survive, 
swive, wive; alcove, cove, drove, mangrove, move, prove; gyve) is the <e> 
of final <ve> part not only of that digraph but also of a split digraph with 
a preceding single vowel letter. In practice this makes no difference - the 
word-final phoneme is /v/, so this aspect hardly needs analysing. 

Gontijo et al. (2003) do not recognise <ve> as a separate grapheme. 
However, word-finally and medially before final <s>, <ve> always indicates 
/v/ regardless of whether it is so recognised, so is 100% predictable. Only 
the medial occurrences in average, deliverable, evening (‘late part of day’), 
every, several, sovereign are problematic. 

In other medial occurrences and all initial occurrences <v, e> are 
separate graphemes, e.g. vest, oven. The only word in which final <v, e> 
are separate graphemes appears to be agave /a'gaivit/. 


9.40 <w> 


N.B. (1) <wh> has a separate entry. 
(2) <aw, ew, ow> have separate entries in chapter 10, sections 10.10/ 


21/34. 
THE MAIN SYSTEM 
Basic /w/ 100% e.g. way 
phoneme 
THE REST 

pronounced 

Exceptions to (none) 
main system 
Doubled letter <ww> /w/ only in bowwow, glowworm, powwow, 


slowworm 


Oddity 


2-phoneme 
graphemes 


<wr> 


(none) 


9.41 <wh> 


THE MAIN SYSTEM 
Basic /w/ 
phoneme 
THE REST 


Exceptions to 
main system 


Oddities 


2-phoneme 
graphemes 


NOTES 


<wh> 


(none) 


(none) 
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only in awry (only non-initial example), 
wrap, wrasse, wreck, wren, wrench, wrest(le), 
wretch(ed), wriggle, wring, wrinkle, wrist, 
write, wrong, Wrotham /'ru:tam/, wrought, 
wry and a few more rare words. The only 
words in which <w, r> do not form a digraph 
appear to be cowrie, dowry 


e.g. what, which. See Notes 


20% Only in who, whom, whose, whole, 
whooping), whooper, whore 


The high percentage for <wh> pronounced /h/ is due to the very high 
frequency of who, whose, whole, and recognition of the few words where 
this correspondence obtains should not be problematic. 

Where <wh> is pronounced /w/ in RP, in many Scots accents it is 
pronounced /m/, which is the voiceless counterpart of /w/ and sounds 
roughly like ‘hw’; but because /m/ is nota phoneme of RP this correspondence 


is not included in my analyses. 


The very few cases where <w, h> do not form a digraph are at morpheme 


boundaries in compound words, e.g. sawhorse. 
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9.42 <x> 


THE MAIN SYSTEM 


Basic /ks/ 82% 
pronunciation 


THE REST 


Doubled letter (does not 


occur) 
pronounced 
Exceptions to 
main system 
<x> /[z/ 
<x> /k/ 
<xX> as 2-phoneme 


sequence /gz/ 


e.g. box, next, six 


18% in total 


regular in initial position, e.g. xylophone 
(except that some people pronounce the 
Greek letter name xias /ksat/); medial 
only in anxiety pronounced /xn'zarjrti:/ 
(also pronounced /zn'gzarjiti:/); rare 
word-finally. See Notes 


only in coxswain /'koksan/ and before 
<c> pronounced /s/ ina small group 
of words of Latin origin, namely exceed, 
excellent), except, excerpt, excess, 
excise, excite 


16% Only in some polysyllabic words of 
Latin origin, namely anxiety pronounced 
/en'gzarjiti:/ (also pronounced 
/xen'zarjrti:/), auxiliary, exact, 
exaggerate, exalt, exam(ine), example, 
exasperate, executive, executor, 
exemplar, exemplify, exempt, exert, 
exigency, exiguous, exile pronounced 
/‘egzatl/ rather than /'eksatl/, exist, 
exonerate, exorbitant, exordium, exotic, 
exuberant, exude, exult and a few 

more rare words; also in Alexandra, 
Alexander and becoming frequent in 
exit /‘egzit/ (also pronounced /'eksit/). 
See Notes 
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<x> as 2-phoneme 1% Only in 3 words of Latin origin: 
sequence /kJ/ flexure, luxury, sexual /'flekfa, ‘lakfari:, 
'sekf(urw)al/ 


<xX> as 2-phoneme only in /uxuriance, luxuriant, luxuriate, 
sequence /g3/ luxurious /\ag'3Zuaritj-ans/ant/ert/as/ 


<x> as 3-phoneme only in X-ray, etc. One of only two 
sequence /eks/ 3-phoneme graphemes in the whole 


language 
Oddities (none) 
2-phoneme (in addition to the basic pronunciation and three of the exceptions to 
sequences the main system) 
<xe> /ks/ only in annexe, axe, deluxe 
<xh> /gz/ only in 7 polysyllabic words of Latin 
origin:exhaust(ion), exhibit, exhilarat-e/ 
ion, exhort, exhume 
<xh> /ks/ only in 3 polysyllabic derivatives of 
words in the previous group: exhibition, 
exhortation, exhumation 
<xi> /kJ/ only in anxious, complexion, connexion 
(also spelt connection), crucifixion, 
fluxion, (ob) noxious 
3-phoneme (see above) 
sequence 
NOTES 


In almost all words beginning <ex-> followed by a vowel letter, if the stress 
is on the initial <e>, the <x> is pronounced /ks/, but if the stress is on the 
next vowel the <x> is pronounced /gz/. The only exceptions are exile, which 
is usually pronounced /'egzatl/, i.e. with initial stress but irregular /gz/ 
(though a regularised spelling pronunciation /'eksail/ is sometimes heard); 
exit, which (conversely) is usually pronounced /'eksit/, i.e. with initial stress 
and regular /ks/, but is increasingly heard as (irregular) /‘egzit/, perhaps 
under the influence of exile; and cf. doxology, luxation, proximity with /ks/ 
despite the stress being on the following vowel. This tendency to pronounce 
<x> as /gz/ before the stressed vowel applies also to the given names 
Alexandra, Alexander, but their abbreviated forms Alexa, Alex have /ks/ 
because the stress falls earlier. 
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<x> pronounced /z/ occurs word-initially only in some words of Greek 


origin, namely xanthine, xanthoma, xanthophyll, xenon, xenophobia and 


several other words beginning xeno-, Xerox and several other words 


beginning xero-, xylem, xylene, xylophone and several other words 


beginning xylo-. Word-finally, the plurals of some French loanwords ending 


in <-eau> are sometimes spelt French-style with <x> as well as <s>, e.g. 


beau-s/x, bureau-s/x, flambeau-s/x, gateau-s/x, plateau-s/x, portmanteau 


s/x, trousseau-s/x, indeed, my dictionary gives only the <x> form in 


bandeaux, chateaux, rondeaux, tableaux. |n all these cases <x> is also 


pronounced /z/. In my opinion the <x> form is outmoded and unnecessary. 


For <xo> in inexorable see section 6.10. 


9.43 <z, zz> 


THE MAIN SYSTEM 


Basic phoneme /z/ 


THE REST 


Exceptions to main 
system 


<Z> 


<Z> 


<Z> 


<ZZ> 


<Z> 97%, 
<ZZ> 97% 


pronounced 


/s/ 


/3/ 


as 2-phoneme 
sequence /ts/ 


as 2-phoneme 
sequence /ts/ 


e.g. zoo, dazzle, jazz 


3% of pronunciations of both 
graphemes in total 


only in blitz(krieg), chintz, ersatz, 
glitz, howitzer, kibbutz, kibitz, 

klutz, lutz, pretzel, quartz, ritz, 
schmaltz, schnitzel, seltzer, spritz(en, 
Switzerland, waltz, wurlitzer 


only in azure pronounced /'xz3~a, 'e139/ 
(also pronounced /'xzj(u)a, ‘e1zj(u)e/), 
Seizure /'sit3a/ 


only in Alzheimer’s, bilharzia, nazi 
(but Churchill said /'na:zi:/), scherzo, 
schizo(-) 


only in intermezzo, paparazzi, pizza, 
pizzicato 
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Word-final doubled (does not 


letter + <e> occur) 

Oddities <ze> /z/ only word-final. In other positions 
<Z, e> are separate graphemes, e.g. 
in zest. The only word in which final 
<Z, e> are separate graphemes is 
kamikaze 

<zi> /3/ only in brazier, crozier, glazier 
pronounced /'breiza, 'krauza, 'gle1za/ 
(also pronounced /'breizi:ja, ‘krauzizja, 
‘glerzi:ja/) 

2-phoneme (see above) 

sequences 

NOTE 


The spelling <zh> is also used to represent /3/, but because it occurs only 
in transcriptions of Russian names, e.g. Zhivago, Zhores, | have not added it 
to the inventory of graphemes. 


9.44 Some useful generalisations about 
graphemes beginning with consonant 
letters 


Almost all occurrences of geminate consonant letters are pronounced 
identically to the single letter. (Rule 28 in Clymer, 1963/1996 expresses 
this as ‘When two of the same consonants are side by side, only one is 
heard.’) To experienced users of English this may seem too obvious to 
state, but there are known instances (see, for example, Burton, 2007: 27) 
of adult literacy learners saying, when this was pointed out to them, ‘Why 
did no-one ever tell me that? | thought there must be two sounds because 
there are two letters, and could never work them out’. And | have witnessed 
an 11-year-old boy having to be taught this by his catch-up scheme tutor. 

There are minor exceptions under <gg, Il, ss, zz> among main-system 
graphemes (and a few more under geminate consonants among the rest), 
but the only major set of exceptions is words with <cc> pronounced /ks/ - 
and even here most instances exhibit regular correspondences: the first <c> 
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is pronounced regularly /k/ before a consonant letter, and the second <c> 
is pronounced regularly /s/ before <e, i, y>, e.g. accent, occiput, Coccyx, 
so that here the real irregularities are the few words with <cc> pronounced 
/k/ before <e, i, y>: baccy, biccy, recce (short for reconnoitre), soccer, 
speccy, streptococci, and the two words with <cc> sometimes pronounced 
/s/ before <i>: flaccid, succinct - both have more regular pronunciations 
with /ks/, and there seem to be no such exceptions before <e, y>. This 
generalisation about geminate consonant letters is a very strong rule. 
The five non-geminate doubled consonant graphemes (<ck, dg, dge, 
tch, ve>) and three of the four digraphs with <h> as the second letter 
(<ph, sh, th>) have virtually no irregular pronunciations, even though <th> 
has two major regular ones. <ch> is the exception, with several irregular 
pronunciations. 
In addition, the lists in this chapter reveal two useful context-sensitive 
patterns: 
The six main-system graphemes other than <sh> which are 
pronounced /Jf/, namely <ce, ci, sci, si, ssi, ti>, are fairly easy to 
distinguish from occurrences of these sequences which are not 
pronounced /f/: these graphemes occur with this pronuncation only 
in medial position, and then mainly between two vowel letters, though 
<si> always has a consonant letter between it and the preceding vowel 
letter, and <ci, sci> sometimes have. Also, five of these graphemes (in 
these contexts) have only one pronunciation. (The exception is <ti>, 
where a few words have /t{/ instead, one (equation) has /3/, and in 
a few words with two occurrences of <ti> before a vowel letter (e.g. 
negotiation) there is alternation between /J/ and /s/.) This pattern, 
unlike the next one, would be difficult to formulate as a rule, and 
learners need to pick it up; 
The ‘soft’ pronunciations of <c, g> as /S, d3/ occur in similar contexts 
to each other (before <e, i, y>), and the ‘hard’ pronunciations /k, g/ 
correspondingly elsewhere. 

The latter generalisation is simple enough to be taught as a rule, but 

teachers need to be alert to cases where learners may over-generalise it. It 

never applies to <ch, tch>, and learners will find various groups of (real or 

apparent) exceptions (Some very rare): 

1) exceptions to ‘<c> followed by <e, i, y> is pronounced /s/’: 
(with /k/) arced, arcing, Celt, Celtic (but the Glasgow football team 
is /‘selttk/), chicer, chicest, sceptic (in British spelling) and words 
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beginning encephal- pronounced /enkefal-/ (also pronounced with 
/ensefal-/) 
(with /f/) cetacean, crustacea(n), Echinacea, liquorice, ocean, 
siliceous and words ending in <-aceous> pronounced /'‘etfas/, e.g. 
cretaceous, curvaceous, herbaceous, sebaceous and about 100 others, 
mostly scientific and all very rare; also officiate, speciality, specie(s), 
superficiality and sometimes ap/de-preciate, associate 
(with /t{/) only cellist, cello, concerto 
(with <cc> pronounced /k/) baccy, biccy, recce (short for reconnoitre), 
soccer, speccy, streptococci; 
2) exceptions to ‘<c> is pronounced /k/ everywhere else (except before 
<h>)’: 
(with /s/): apercu, facade; 
3) exceptions to ‘<g> followed by <e, i, y> is pronounced /d3/’: 
(a fair number with /g/ (see section 9.15), some of them high- 
frequency words): gear, get, give, tiger; giggle, girl, give 
(with other but rare pronunciations): see section 9.15; 
4) exceptions to ‘<g> is pronounced /g/ everywhere else’: 
(with /d3/): gaol, margarine, Reg, veg, and second <g> in mortgagor 
(with other but mostly rare pronunciations): see section 9.15 
(with <gg> pronounced /d3/): arpeggio, exaggerate, loggia, Reggie, 
Suggest, veggie, vegging. 
For practical purposes with young learners, the rule about the ‘soft’ and 
‘hard’ pronunciations of <c, g> can be considered 100% reliable, though 
they would probably need to be taught liquorice and ocean. 

Inspection of the headings of sections 9.6-43 will show that about half 
give the percentage of the basic pronunciation as 100%, and several others 
are close to that. In quite a few other cases attention to the context will 
combine lower percentages into something over 90% or in the upper 80%’s. 
The only ones in the lower 80%’s are <wh, x>. Overall the predictability of 
the pronunciations of main-system graphemes beginning with consonant 
letters is probably over 90%. The two major exceptions are medial and final 
<s> and word-final <se>, both of which have the two main pronunciations 
/s, z/, and for which few useful generalisations can be given. Even so, the 
pronunciations of consonant graphemes are much more predictable than 
those of vowel graphemes, as is obvious from chapter 10. 


10. The grapheme-phoneme 
correspondences of English, 
2: Graphemes beginning 
with vowel letters 


10.1 The general picture: the regular 
pronunciations of English graphemes 
beginning with vowel letters 


All the introductory remarks in sections 9.1-3 also apply here, with one 
addition. Like Carney, Gontijo et al. (2003) analysed a variety of RP in which 
/1/ does occur word-finally and is mainly spelt <y>, sometimes <i, ie>. 
As explained in sections 5.2, 5.4.3, 5.6.4 and 5.7.2, | disagree with this 
analysis, and instead, like Cruttenden (2014), posit that /1/ does not occur 
word-finally. This meant that, in the phoneme-grapheme direction, | took 
issue with Carney’s percentages for correspondences for /1, 1a/, and for 
/it/ dispensed with them completely. The mirror-image situation is that for 
<ie> (section 10.23) | was able to re-calculate Gontijo et al.’s percentages, 
but could not do so for <i, y> (sections 10.22, 10.40), which therefore have 
no percentages. 

This chapter contains 38 main entries, one for each of the 38 main- 
system graphemes beginning with a vowel letter. For those graphemes, the 
general picture can be summed up by saying that: 

just 4 have only one pronunciation: <air eer igh ore> (except for one 
tiny exception under <ore>) 

18 have only one main-system pronunciation, but varying numbers 
of minor correspondences which are exceptions to the main system: 
<a.e ai are au aw ay ea €€ €.e ere ie i.e if O.e Oi OU OY Ur> 
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3 have only two main-system pronunciations (and no minor ones), 

and those two pronunciations occur in circumstances which can be 

fairly closely defined: <ed ue u.e> 

6 have only two main-system pronunciations, which occur in 

circumstances which can be fairly closely defined (but varying numbers 

of minor correspondences): <ar ear ew 00 or ow> 

7 are moderately to highly variable: <ae eriouy>. It is uncomfortable 

that this category includes all six of the vowel letters as single-letter 

graphemes. 
And the lists just given still somewhat understate the case, since there are 
large numbers of Oddities - see Table 10.1. 

Table 10.1 is almost but not quite the mirror-image of Table 8.2 because: 

graphemes which begin with vowel letters but consonant phonemes 

(e.g. <u> in union) are included here; 

graphemes which begin with consonant letters but vowel phonemes 

(e.g. <ho> in honour are not included here but in Table 9.4. 
For completeness, it should also be noted that many minor vowel graphemes 
have highly predictable pronunciations, e.g. <augh>. In fact, of the 105 
graphemes beginning with vowel letters that are outside the main system, 
only 28 <ae ah al ao ais aye eau ei eigh eir eo eu eur ey ia is 0a Oar oe Oir ois 
oor ough our ua ui ure yr> have more than one pronunciation. As with the 
minor consonant graphemes, in any attempt (not made here) to estimate the 
overall regularity of the system this would need to be taken into account; 
and again, many minor graphemes are so rare that they would not affect the 
regularity calculation unless they occur in high-frequency words. 


TABLE 10.1: MAIN-SYSTEM GRAPHEMES BEGINNING WITH VOWEL LETTERS, 
BY MAIN-SYSTEM AND MINOR CORRESPONDENCES AND NUMBERS OF ODDITIES. 


Main system The rest 
Grapheme | Basic Other main- Exceptions to main Number of Oddities * 
phoneme | system system (minor which the grapheme 
correspondences | correspondences) ‘leads’ 
a /xe/ /el arp: 9/ /etat/ 22 
a.e /e1/ /ax/ 
ai /e1/ /ezetaat/ 4 
air /ea/ ] 
ar /ax/ /ea d1/ /a/ 
are /ea/ /ax/ 3 
au /o1/ /O av eI aU at/ 3 
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aw /:/ /31/ 1 

ay /er/ /e ix/ 4 

e /e/ /ixt9/ /o e1/ 35 

ea /ix/ /e e1/ 5 

ear /1a/ /ea/ /ax 31/ 

ed /d/ /t/ 

ee /it/ /er rT u:/ 

e.e /ix/ /er/ 

eer /ta/ 

er /31/ /19 9/ /ax ea er/ 2 

ere /ta/ /ea 31/ 

ew /ux/ /jux/ /a3u/ ] 

i /1/ /ix atj/ /xvo9/ 7 

ie /ix/ /etat/ 2 

i.e /at/ /ix/ 

igh /at/ 

ir /31/ /1a al ata/ 2 

(o) /o/ /A UL BU 9/ /1 U WA/ 17 

0.e /au/ /ur/ 

oi /o1/ /a wair/ 6 

oo /v/ /ux/ /A 3U/ 3 

or YOry /31/ /va/ 4 

ore />:/ /au/ 

ou /au/ /0 AU 2 9U UI W/ 18 

ow /au/ /au/ /0 3/ 1 

oy /o1/ /at/ ] 

u [A] /w v ur jur / /etajea/ 8 

ue /u:/ /jux/ 

u.e /ux/ /jux/ 

ur /31/ /3 Va jua/ 5 

y /j at/ /it 1/ /a/ 7 

Total 39 36 71 162 

38 75 233 
Grand total of correspondences: 308 


* including 2- and 3-phoneme pronunciations which are not part of the main system. 
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10.2 Order of description 


In this chapter | deal in conventional alphabetical order with 33 of the 38 
main-system graphemes of English which begin with vowel letters. The 
other main entries cover five of the six split digraphs. Three of these come 
immediately after the grapheme consisting of the same two letters not split, 
namely <e.e, i.e, u.e> after <ee, ie, ue> respectively. However, because 
<ae, oe> do not, in my analysis, belong to the main system and are dealt 
with under <a, o>, the sections dealing with <a.e, 0.e> follow the sections 
dealing with <a, o>. The only split digraph which does not have a main 
entry is <y.e>, which is not part of the main system; it is dealt with under 
<y>, immediately after <ye>. 

In most of the 38 main entries in this chapter I list the items in this order: 
1) The basic phoneme. In my opinion, each of these graphemes has a basic 

phoneme, the one that seems most natural as its pronunciation. 

2) Any other phonemes which are frequent pronunciations of the grapheme. 
These two categories constitute the main system for grapheme- 
phoneme correspondences for graphemes beginning with vowel 
letters. Correspondences in the main system are shown in 9-point 
type, the rest in smaller 7.5-point type. 

3) Exceptions to the main system, including any 2- or 3-phoneme 
correspondences for the main grapheme. The reason for listing 
exceptions to the main system separately from the Oddities is that this 
is the clearest way of showing where the main rules break down. 

4) Oddities, minor graphemes which begin with the letter(s) of the main 
grapheme and occur only in restricted sets of words. 

5) Any 2- and 3-phoneme graphemes which include, but do not have 
entirely the same spelling as, the main grapheme. Almost all of these 
are also Oddities, but a few belong to the main system and are included 
there. 

Most entries end with Notes; none have Tables, but <i> (section 10.22) has 

a flowchart. 

The only exceptions to this ordering are the four graphemes which have 
only one pronunciation each: <air, eer, igh, ore>. Under each of these there 
is usually just one heading, ‘Only phoneme’, and it is automatically part of 
the main system without having to be so labelled; however, the entries for 
<igh, ore> have Notes. 
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10.3 <a> 


N.B. <a.e, ai, air, ar, are, au, aw, ay> have separate entries. 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic /x/ 50% e.g. cat, pasty /‘pesti:/ (‘pie’) 
phoneme 

Other /a/ 16% e.g. about, dynamo, opera. Regular 
phonemes when unstressed in all positions, 


including a, an, but see Exceptions 

and Notes in this section and Notes in 
the next section for unstressed <a> 
pronounced /e, I, 0, 1, aI, ar/. Also see 
Notes in next section for words with 
final <-ate> pronounced /at/ 


/ax/ 9% e.g. blasé, father 

/er/ 8% e.g. agent, bacon, pasty /'petsti:/ 
(‘whey-faced’) 

/o/ 8% e.g. squash, was, what. Regular after 
<qu, w> 

/>:/ <1% e.g. always, bald, tall, water. Regular in 


some circumstances 


THE REST 
pronounced 
Exceptions to <a> /e/ 1% only in: 
main system - any, ate pronounced /et/ (also pronounced 


/e1t/), many, Thames, first <a> in secretaria-l/t, 
second <a> in asphalt pronounced /'z/Jfelt/ 
(also pronounced /'‘zsfelt/) 

- a few words ending <-ary> with the stress 

two vowels before the <a>, e.g. necessary, 
secretary pronounced /'nesaseri:, 'sekrateri:/ 
(also pronounced /'nesasri:, 'sekratri:/ with no 
vowel phoneme corresponding to the <a> - for 
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Oddities 


<a> 


<a> 


<aa> 


<aar> 
<ach> 


<ae> 


<ae> 


/1/ 


/at/ 


/e/ 


the elided vowels in this and the next three 
paragraphs, see section 6.10) - a few adverbs 
ending <-arily>, e.g. militarily, necessarily, 
primarily, voluntarily pronounced /mulr'tertli:, 
nesoa'sertliz:, prar'mertli:, volan'tertli:/ with 

the <a> stressed (also pronounced /'milrtrali:, 

‘nesasrali:, ‘prarmrali:, 'volantrali:/ with stress 
shifted two vowels forward, again no vowel 
phoneme corresponding to the <a>, and the 
vowel before /li:/changed from /1/ to /2/) 

- temporary pronounced /'tempereri:/ (also 
pronounced /'tempreri:/ with no vowel 
phoneme corresponding to the <o>) 

- temporarily pronounced /temps'rertli:/ (also 
pronounced either /tempa'reartli:/ with <ar> 
pronounced /ea/ and the <r> also a grapheme 
in its own right pronounced /r/ - for dual- 
functioning see section 7.1 - or reduced to 
three syllables as /'temprali:/ with no vowel 
phonemes corresponding to the <o> or the 
<a> and the two /r/ phonemes reduced to one) 


1% only unstressed but in about 250 words 
ending in <-age>, which is mainly pronounced 
/1d3/, e.g. village, plus furnace, menace, 
necklace, octave, orange, signature, surface, 
spinach pronounced /'spinidg/ and second 
<a> in character, palace. For words where the 
ending <-age> is pronounced /e1, a:3/ see 
<a.e>, section 10.4 


only in majolica pronounced /mar'joltka/ (also 
pronounced /ma'dgplika/), naif, naive, papaya 


only in baa, Baal, Graal, kraal, laager, naan, 
salaam 


only in aardvark, aardwolf, bazaar, haar 
only in yacht 


is the usual correspondence, e.g. aegis, 

aeon, aesthetic, algae, alumnae, antennae, 
archaeology, Caesar(ian), caesura, mediaeval, 
pupae, vertebrae. For exceptions see next 3 
rows 


only in haemorrhage, haemorrhoid 


<ae> 
<ae> 


<aer> 


<ah> 


<ah> 


<ah> 


<al> 


<al> 


<al> 
<alf> 
<anc> 
<ao> 
<ao> 
<aoh> 
<aow> 
<as> 
<at> 


2-phoneme (none) 
graphemes 


NOTES 


/et/ 
/at/ 
/ea/ 


/a/ 


/ax/ 


/e1/ 
/ax/ 


/3:/ 


/e/ 

/e1/ 
/a/ 

/e1/ 
/ea/ 
/au/ 
/av/ 
/ax/ 
/ax/ 
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only in brae, Gaelic, maelstrom, reggae, sundae 
only in maestro, minutiae 


only in faerie and compounds of air spelt 
<aer>, e.g. aerial. The <r> is also a grapheme 
in its own right pronounced /r/. For dual- 
functioning see section 7.1 


only word-final and only in ayah, cheetah, 
fellah, haggadah, Hannah, hallelujah, loofah, 
messiah, moolah, mullah, mynah, pariah, 
purdah, (maha) rajah, Sarah, savannah, 
verandah, wallah and some other very rare 
words 


only word-final and only in ah, bah, hookah, 
hoorah, kabbalah, Shah, whydah 


only in dahlia 


only in calf, half, calve(s), halve(s), salve(s) (also 
pronounced /selv(z)/); almond, almoner, alms, 
balm, calm, embalm, malmsey, napalm, palm, 
psalm, qualm 


only in balk, calk, chalk, stalk, talk, walk. See 
also <aul> under <au>, section 10.9 


only in salmon 

only in halfpence, halfpenny 
only in blancmange /bla'monds / 
only in gaol 

only in aorist 

only in pharaoh 

only in miaow 

only in fracas 


only in eclat, entrechat, nougat 


For instances of <a> as an elided vowel see section 6.10. 
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<a> is the least predictable of the single-letter vowel graphemes. Its 
default pronunciation as a single-letter grapheme is /z#/, which occurs 
in many uncategorisable circumstances, but here are a few categories for 
guidance: 
regular before geminate and doubled consonant spellings, e.g. 
flabbergasted (first <a>), back, cackle, add, addled, badge, badger, 
chaffinch, gaff, gaggle, ammo, annual, banns, apple, Lapp, arrow, 
classical, lass, match, satchel, battle, matt, jazz, plus <cc> when 
the two letters are pronounced separately, e.g. accent. Extension: 
gaffe. Exceptions (in RP): chaff, distaff, staff and most words with 
final <-ass>, but this is a small set, namely brass, class, glass, grass, 
pass, with /ax/ (and there are four sub-exceptions with regular /x/: 
ass, bass (‘fish’), lass, mass and one, bizarrely, with /e1/: bass /bets/ 
‘(player of) large stringed instrument’/‘(singer with) low-pitched 
voice’), most words with final <-all> (see below), e.g. all, with /3:/ 
(sub-exceptions: mall, shall with regular /z/), and several words 
with preceding <w>, namely swaddle, swallow, twaddle, waddle, 
waddy, waffle, wallet, wallop, wallow, wally, wannabe, warrant, 
warren, warrigal, warrior, wassail, wattle, plus quaff, quagga (also 
pronounced with /z/), quarrel, quarry, scallop, squabble, all with /v0/ 
regular in other words where <a> is the only or last vowel letter 
and is followed by at least one consonant letter (i.e. those without 
a geminate or doubled consonant spelling), e.g. asphalt pronounced 
/‘esfalt/ (also pronounced /'zffelt/), bad, balderdash, bombast, cat, 
detract, gymnast, impact, lambast, pant. Exceptions: see those with 
/ax, 9:/ below 
in some words with <a> as the penultimate vowel letter followed by 
two or more consonant letters (or <x> and word-final <e>), other 
than those with <-a.e> pronounced /e1, a:/ (see the next section) and 
the long list of those with /a:/ (see below), e.g. axe, collapse, flange 
as the vowel letter before a consonant letter or cluster and the endings 
<-ic(al)>, e.g. asthmatic, classic(al), drastic, dynastic, elastic, heraldic, 
mastic, peristaltic, plastic, spastic (only exception: aphasic, with /e1/). 
The stress always falls on the relevant <a> except in Arabic, lunatic, 
which are different from almost all other words in <-aCic> both in 
having the stress on the vowel two before the <-ic> and in having 
the <a> before the <ic> ending pronounced /a/. There are also two 
relevant words with variant pronunciations: (fly) agaric /a'gerik/ 
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(regular) or /‘egartk/ (exception), chivalric /ft'velrik/ (regular) or 
/‘Sivalrtk/ (exception); on chivalric the Oxford English Dictionary says 
‘The first pronunciation is that sanctioned by the poets’ 
in some other words when <a> is at least the penultimate vowel letter 
(or, if the word-final letter is <e>, at least the antepenultimate vowel 
letter) and is followed by more than one consonant letter, e.g. alto, 
altitude, altruism, bastion, chastise, formaldehyde, gasket, gather, 
mastiff, mastoid, pastel, pastern, pastille, pasty (‘pie’), satchel, and 
first <a> in advantageous, asthmatic, asphalt whether pronounced 
/‘esfalt/ or /‘effelt/, bastinado, cantata, cascara, fantasia, maltreat, 
mascara 
in a further ragbag of non-final occurrences, e.g. acid, chariot, 
companion, habit, lavish, manioc, parish, patio, piano, placid, ration, 
vanish; (first <a>) avalanche, avocado, balaclava, basalt, caviar, 
marijuana, national, panorama, (inrational, valiant; (first and second 
<a>’s) alpaca; (second <a>) battalion. 

The task then is to define the circumstances in which <a> has other 

pronunciations. These can be summarised as follows. 

For <a> pronounced /a:/ in RP (where it is much more frequent than in 
most other accents of English) Carney (1994: 291-4) gives a set of five rules, 
all of which have special conditions and exceptions. Instead, here is a set of 
categories with lists of examples (but with exceptions only for one category; 
for others they would be too numerous to list): 

word-finally: only in bra, ma, pa, schwa, spa; grandma, grandpa, 
hoopla, mama, papa 

often when <a> is the penultimate vowel letter and there is at least 
one earlier vowel letter in the word separated from it by at least one 
consonant letter and the relevant <a> is followed by a single consonant 
letter followed by word-final <a, i, o>: armada, avocado, balaclava, 
banana, bastinado, cantata, cascara, cassava, cicada, cinerama, 
cyclorama, desiderata, desperado, farrago, Gestapo, gymkhana, 
iguana, incommunicado, karate, legato, liana, literati, marijuana, 
mascara, meccano, pajama, palaver, panorama, pastrami, pyjama, 
safari, salami, schemata, sonata, soprano, staccato, stigmata, sultana, 
svengali, tiara, toccata, tomato, tsunami, virago. Extensions (1) with 
‘pronounced’ final <e>: biennale, blase, finale, karate, macrame 
(contrast sesame, with /9/); (2) a few words with no earlier vowel letter: 
drama, gala, guano, guava (in these two words <u> is a consonant 
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letter), khaki, lager, lama, lava, llama, nazi, plaza, pro rata, saga, 
strata. Exceptions: alpaca, piano with /z/; dado, data, halo, lumbago, 
potato, sago, tornado, volcano with /e1/ 
often before two consonant letters in words where <a> is the only 
vowel letter or the word-final letter is <e>: chaff, staff, aft, craft, 
graft, haft, raft, shaft; hajj; chance, dance, glance, lance, prance, 
trance; blanch, branch, ranch, can’t, chant, grant, plant, shan’t, slant; 
graph, ask, bask, cask, flask, mask, task; clasp, gasp, grasp, hasp, 
rasp; basque, casque, masque; brass, class, glass, grass, pass, blast, 
cast, caste, fast, last, mast, vast; bath, lath (also pronounced with 
/z#/), path, plus tranche with three consonant letters; probably none 
of these words would be pronounced with /a:/ in any accent other 
than RP 
in all the unsuffixed compounds of graph: auto/cardio/choreo/di/ 
encephalo/epi/mimeo/para/photo/quad/tele/tri-graph, where -graph 
is always unstressed; again, probably none of these words would be 
pronounced with /a:/ in any accent other than RP 
often before two or three consonant letters in words not covered in 
previous categories: macabre; debacle, cadre, padre; distaff, giraffe; 
abaft, after, rafter, example, sample; advance, chancel, chancery, 
enhance; avalanche, revanchis-m/t, command, commando, countermand, 
demand, remand, slander, rascal (first <a>); answer; basket, bergomask, 
casket; aghast, caster, castle, castor, contrast (verb and noun), disaster, 
fasten, flabbergasted (second <a>), ghastly, master, nasty, pasta (also 
pronounced with /z/), pasteurised, pastime, pastor, pasture, plaster, 
father, lather (also pronounced with /2x/), rather, latte. Unstressed only 
in distaff, avalanche, contrast (/‘kontra:st/, noun), flabbergasted. \n 
this category, the first 4 words would probably be pronounced with /a:/ 
in all accents, the rest only in RP 
otherwise only in amen pronounced /a:'men/ (also pronounced 
/et'men/), banal (second <a>), corral, praline, raj. 
(For <a.e> pronounced /a:/ see the next section). 
<a> is pronounced /p/ mainly after <qu, w> and only in the following 
groups of words: 

after <qu>, e.g. quad and all its derivatives, quadrille, quaff, 
quag(mire) (also pronounced /'kweg(-)/), quagga (also pronounced 
/‘kwega/), qualify and all its derivatives and associates, (e)quality, 
quandary, quant and all its derivatives, quarantine, quarrel, quarry, 
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quash, quatrain, squab(ble), squad and derivatives, squal-id/or, 
squander, squash, squat. Unstressed only in quadrille. Only exception, 
strictly speaking: squall, with />:/. However, there are also fairly large 
sets of apparent exceptions which contain the <ar> digraph or the 
<are> trigraph - see sections 10.7-8 

after <w>, e.g. swab, swaddle, swallow, swamp, swan, swap, 
swash(buckling), swastika, swat, swatch, twaddle, wad, waddle, 
waddy, wadi, waffle, waft pronounced /woft/ (also pronounced 
/wo:ft/), wallet, wallop, wallow, wally, walrus pronounced /'wolras/ 
(also pronounced /'wo:lras/), wampum, wan, wand, wander, wannabe, 
want, wanton, warrant, warren, warrigal, warrior, was, wash, wasp, 
wassail, wast, watch, watt, wattle; in all these words the <a> is 
stressed. Only exceptions, strictly speaking: walk (where <al> is in 
any case a digraph), wall, water, all with /3:/ - but see again the last 
sentence of the previous paragraph 

otherwise, only in ambience, bandeau, bouffant, chanterelle, 
confidant(e), debutante, fiance(e), flambe, flambeau, insouciance, 
mange-tout, moustache (now mostly pronounced with /a:/ in RP), 
nuance, scallop, séance, wrath pronounced /rv@/ (also pronounced 
/r3:8/), what, first <a> in jalap, stalwart, second <a> in blancmange, 
diamante; unstressed only in bouffant, confidante), debutante, 
insouciance, nuance, seance. 

For a teaching rule based on the words with <qu, w, wh> followed by <a> 

see section 11.5. 
<a> is pronounced /9):/: 

1) in <al-> word-initially when it is a prefix reduced from all: albeit, 
almighty, almost, already, although, altogether, always and even in the 
mistaken spelling “alright. All unstressed except almost, always 

2) before <lId, It>: alder, alderman, bald, balderdash, baldric, scald, 
thraldom (also. spelt thralldom); altar, alter, alternate (both 
pronunciations and meanings - see the next section), although (again), 
altogether (again), balti, basalt, cobalt, exalt, falter, halt, halter, malt, 
paltry, salt; unstressed only in alternate pronounced /):I't3:nat/ (‘every 
other’), although, altogether, basalt, cobalt. Exceptions: formaldehyde, 
heraldic, alto, altitude, altruism, maltreat, peristaltic with /#/; asphalt 
whether pronounced /'zsfalt/ or /'‘effelt/; contralto with /a:/; emerald, 
herald, ribald, loyalty, penalty, royalty, subaltern with /2/ 
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3) 


4) 


before word-final <-ll>: all, ball, call, fall, gall, hall, pall, small, squall, 
(in)stall, tall, thrall, wall (but mall, shall have /z/; hallo, though the 
<a> is not word-final, is the only other example of <a> before <Il> 
pronounced /z/ (sometimes: the pronunciation of this word varies 
between /hz'lau, he'lau/ and /ha'lau/); and in installation the shifting 
of the stress because of the suffix reduces the <a> to /a/, which is 
also the pronunciation of unstressed <a> before <Il> in, e.g., balloon). 
The <-all> pronounced /):I/ group is one of only five cases where the 
pronunciation of a phonogram/rime is more predictable as a unit than 
from the correspondences of the separate graphemes, and there are 
enough instances to make the rule worth teaching; see section A.7 in 
Appendix A 

otherwise only in appal (second <a>), balsam (first <a>), falcon, enthral, 
instalment, palfrey, water, also waft, walrus, wrath pronounced 
/wo:ft, 'wo:lras, ro18/ (also pronounced /woft, 'wolras, ro8/); in all these 
words the relevant <a> is stressed. 


See also <al> pronounced /3:/ in the Oddities above, and <aul> under 


<au>, section 10.9. 


<a> (as distinct from <a.e> - see next section) is pronounced /e1/ in 


just one word where it is the only vowel letter, namely bass /bets/ ‘(player 


of) large stringed instrument’/‘(singer with) low-pitched voice’), and in 


four categories of longer words where a rule can be stated, plus a ragbag 


category where any rules would be too complex to be worth stating. <a> is 


pronounced /e1/ in: 


1) 


2) 


large numbers of words where <-e> has been deleted before a suffix 
beginning with a vowel letter, e.g. creation, navigating - see sections 
6.3 and especially 6.4. Exception: orator /‘orata/, where stress has 
shifted from orate /):'re1t/ 
large numbers of words where <a> is followed by a single consonant 
letter other than <r> and then by: 
any of <ea, eou, ia, io, iou, iu> followed word-finally by a single 
consonant letter or none, e.g. azalea; advantageous, cretaceous, 
herbaceous, instantaneous, subcutaneous, alias, facial, fantasia, 
regalia, labial, mania(c), palatial, racial; contagion, equation, evasion, 
invasion, occasion and hundreds more ending in <-asion>, nation 
and thousands more ending in <-ation>, radio, ratio; pugnacious; 
gymnasium, stadium, uranium 
(<-ien-ce/t>, e.g. patience; gradient, patient, salient. Extension with 
two intervening consonant letters: ancient. 
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Exceptions: battalion, caviar, chariot, companion, manioc, patio, ration, 
valiant, plus national, which is a derivative of a word which obeys the 
rule, and (iN rational, which are derivatives of a word which does not; all 
these words have /2/. 
In all these words (including the exceptions) the stress falls on the 
relevant <a> 
3) a small group of words (and derivatives) where <a> is followed by 
a consonant letter and then <le, re>, e.g. (dis/en-)able, cable, fable, 
gable, sable, (un)stable, table; cradle, ladle; maple, staple; sabre; acre, 
nacre (but not cadre, padre, which have /a:/). Again, in all these words 
(including the exceptions) the stress falls on the <a> 
4) almost all words in the ending <-ator>. Only exceptions: conservator, 
conspirator, orator, predator, senator, which all have <a> pronounced 
/a/ and stress on the vowel before that. All other words ending in 
<-ator> have the stress on the <a> if they have only one earlier vowel 
letter, e.g. creator, curdtor, dictdtor, spectator, otherwise on the vowel 
two before the <a>, e.g. administrator, agitator, aviator, calculator, 
commentator, insulator 
5) in the following uncategorisable words: aorta, apron, bacon, basal, 
bathos, blatant, blazon, cadence, canine, capon, chao-s/tic, fatal, favour, 
flavour, fragran-ce/t, kaolin, labour, lady, latent, mason, matron, nadir, 
nasal, natron, naval, pagan, papal, pastry, patent (‘obvious’; the word 
of the same spelling meaning ‘registered design’ can be pronounced 
with /e1/ or /2/), pathos, patron, planar, saline, savour, scalar, status, 
tapir, vacant, vagrant, vapour, wastrel, (first <a>) papacy, vacancy, 
vagary, vagrancy, wastrel, also creative, dative, native - in all other 
adjectives ending <-ative> the <a> is unstressed and pronounced /a/. 
The <a> is stressed in all these words except aorta, chaotic. 
<a> is pronounced /a/ only when unstressed. Even though this is the 
predominant pronunciation in unstressed syllables (which in any case 
cannot be deduced from the written forms of words - see section A.10 in 
Appendix A), virtually the only rule that can be given for where /a/ occurs is 
that given in the second paragraph of these examples: 
word-initially: abaft, about, advance, aghast, ago, appal, arrange, 
askance, askew, awry, azalea 
medially in the endings <-able (in words where there is at least 
one earlier vowel letter), -al, -ance, -ancy, -ant, -ative, -iary>, 
e.g. biddable, liable, syllable, valuable; actual, facial, fatal, labial, 
nasal, national, naval, normal, palatial, papal, (inrational, visual: 
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blatant, fragran-ce/t, vacan-cy/t, vagran-cy/t, valiant; causative, 
laxative, palliative (only exceptions: creative, dative, native - see just 
above); apiary, auxiliary, aviary, breviary, domiciliary, intermediary, 
pecuniary, stipendiary, subsidiary, topiary 

medially in some words ending in <-ate> - see next section 

medially also in thousands of unclassifiable words, e.g. archipelago, 
balloon, breakfast, buffalo, conservator, conspirator, dynamo, 
emerald, herald, loyalty, lunatic, orator, papa, penalty, predator, 
ribald, royalty, senator, subaltern, first <a> in battalion, blancmange, 
encephalogram, instantaneous, palatial; second <a> in alias, Arabic, 
avalanche, balaclava, ballast, balsam, damask, pagan, papacy, 
paragraph, vagary 

word-finally: aorta, armada, aroma, azalea, balaclava, banana, 
bravura, cantata, cascara, cassava, cicada, cinerama, cyclorama, 
data, desiderata, drama, fantasia, gala, guava, gymkhana, hosta, 
iguana, lama, lava, liana, llama, marijuana, mania, mascara, opera, 
pajama, panorama, pasta, plaza, pro rata, pyjama, regalia, saga, 
schemata, sonata, stigmata, strata, sultana, tiara, toccata. 


10.4 <a.e> 


Occurs only where the <e> is word-final. 
See Notes for all categories and for how this split digraph is defined, 
and see section 11.4 for a teaching rule relevant to all split digraphs except 


<y.e>. 

THE MAIN SYSTEM 

Basic /et/ 68% regular in words where <a> is the only 

phoneme vowel letter other than the word-final <e>), 
e.g. make, take; in longer words, only in 
compounds plus assuage, engage, rampage 

Other /ax/ 32% only in about 40 mostly French loanwords, 

phoneme e.g. charade, mirage 

THE REST 

Exceptions to main system strictly speaking, none, but see Notes 

Oddities (none) 


2-phoneme graphemes (none) 
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NOTES 


The split digraph <a.e> is defined as covering words where word-final <e> 
is separated from the <a> by one consonant letter other than <r, w, x, y> 
and the <a> is not preceded by a vowel letter and the digraph is pronounced 
either /er/ or /a:/. The definition covers both words where the intervening 
consonant letter is an independent grapheme and words where the <e> is 
also part of a consonant digraph <ce, ge, ve> - see sections 3.7.4, 3.7.6-7 
and 3.8.4, and section 7.1 for dual-functioning. 

The familiar /et/ pronunciation occurs in many hundreds of words and 
does not need further illustration. The /a:/ pronunciation occurs only in 
about 40 (mostly French) loanwords; those which fit the main definition 
just given (for extensions see below) are aubade, ballade, charade, chorale, 
facade, grave (/gra:v/, ‘French accent’), locale, morale, pavane, promenade 
(noun, ‘seafront path’; the verb with the same spelling, ‘walk at leisure’, is 
pronounced with /e1/), rationale, rodomontade, roulade, soutane, strafe, 
Suave (where the <u> is a consonant letter), timbale, vase, plus a set of 
words ending in <-age> pronounced /a:3/, namely badinage, barrage, 
camouflage, collage, corsage, decalage, décolletage, dressage, entourage, 
espionage, fuselage, garage pronounced /'gera:3/, massage, menage, 
mirage, montage, triage, sabotage. 

The definition needs the following extensions: 

eight words in which two consonant letters forming a consonant 
digraph separate <a.e>: ache, bathe, champagne, lathe, unscathed 
(the free form “scathe meaning ‘to harm’ does not occur, but underlies 
both unscathed and scathing), swathe with /e1/, gouache, moustache 
with /a:/ (contrast attache, where the <a> is pronounced /x/ and 
the <e> is a separate grapheme pronounced /e1/ and is increasingly 
written within English text as <é>; also contrast cache, panache, 
where the <a> is again pronounced /z/, and <che> is a trigraph 
pronounced /{f/) 

five words in which <gu, qu> forming consonant digraphs separate 
<a.e>: opaque, plague, vague with /e1/, claque, plaque with /a:/ 
eight words ending in <-ange> pronounced /etnd3/ (i.e. <n, g> do 
not form a digraph): arrange, (ex)change, grange, mange, range, (e) 
strange (contrast the only other three words ending <-ange>, all with 
<e> as part of digraph <ge> pronounced /d3/ and <a> as a separate 
grapheme with varying pronunciations: blancmange with /v/, flange 
with /z/, orange with /1/). Note that in the words with <-ange> 
pronounced /eindz/ the <e> is not only part of digraph <a.e> but also 
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forms part of digraph <ge> pronounced /dg/ - for dual-functioning 
see section 7.1 
seven words ending in <-aste> pronounced /eist/: baste, chaste, 
haste, lambaste, paste, taste, waste, and one with<-aste> pronounced 
/axst/: caste. 
For an attempt to justify this definition despite its circularity and fuzzy 
edges see Appendix A, section A.6. 

In all cases where the <e> is not the last letter in the stem word, <a, e> 
with an intervening letter(s) are separate graphemes. This is also true of all 
words with <a> and word-final <e> separated by more than one consonant 
letter or by a consonant digraph, except the 29 words listed above. 

Where <a> and word-final <e> are separated by just one consonant 
letter and the <a> is preceded by a consonant letter, the position is more 
complicated. Many such words look as if the <a, e> should constitute a 
split digraph - but they do not, according to my definition, because the 
vowel phoneme preceding the stem/final consonant phoneme is neither 
/et/ nor /a:/. However, guidance is still needed on when words of this sort 
do not have either of the split digraph pronunciations, especially since there 
are pairs of words with identical spelling of which one does have <a.e> 
pronounced /e1/ and the other does not. 

There are two groups of words in which unstressed <a> before stem- 
final <te> is not part of a digraph <a.e> pronounced /e1/ and is instead 
pronounced /a/: 

at least 60 words (all nouns/adjectives) where this is the only pronun- 
ciation, e.g. accurate, adequate, agate, appellate, celibate, chocolate, 
climate, collegiate, conglomerate, (in)considerate, consulate, consum- 
mate, delicate, desperate, (in)determinate, directorate, disconsolate, 
doctorate, electorate, episcopate, extortionate, fortunate, illegiti- 
mate, immaculate, immediate, inanimate, incarnate, in(sub)ordinate, 
inspectorate, intricate, inviolate, (bacca)laureate, legate, (illiterate, 
magistrate, novitiate, obdurate, palate, particulate, (com/dis)passion- 
ate, private, profligate, proletariate, (dis)proportionate, protectorate, 
proximate, roseate, senate, surrogate, (in)temperate, triumvirate, ulti- 
mate, (in) vertebrate 

a further set of about 30 nouns/adjectives with final <-ate> 
pronounced /at/ where the verbs with the same spelling have <-ate> 
pronounced /eit/, e.g. advocate, affiliate, aggregate, alternate (here 
with also a difference in stress and vowel pattern: noun/adjective 


The grapheme-phoneme correspondences, 2 359 


pronounced /o:I't3:rnat/, verb pronounced /'d:ltaneit/), animate, 
appropriate, approximate, articulate, associate, certificate, coordinate, 
curate (here with also a difference in meaning and stress: noun 
(‘junior cleric’) pronounced /'kjuarat/, verb (‘mount an exhibition’) 
pronounced /kjua'rert/), degenerate, delegate, deliberate (here with 
also a difference in syllable structure: adjective /dr1'ltbrat/ with three 
syllables and an elided vowel - see section 6.10; verb /dr1'l1barert/ 
with four syllables), designate, desolate, duplicate, elaborate, 
estimate, expatriate, graduate, initiate, intimate, legitimate, moderate, 
pontificate (here with unrelated (?) meanings: noun (‘pope’s reign’) 
pronounced /pon'tifikat/, verb (‘speak pompously’) pronounced 
/pon'tifikeit/), predicate, separate (here too with a difference in 
syllable structure: adjective /'seprat/ with two syllables and an 
elided vowel - see section 6.10; verb /'separeit/ with three syllables), 
subordinate, syndicate, triplicate 
There is no rule by which the words with <-ate> pronounced /at/ can be 
distinguished from those with <-ate> pronounced /eit/ - they just have to 
be learnt. Where <-ate> is pronounced /at/ the <e> is phonographically 
redundant. 
There are hundreds of English words ending <-age>. In words where 
<a> is the only vowel letter) and their derivatives, e.g. enrage, interstage, 
plus assuage, engage, rampage, <a.e> is a digraph with the regular 
pronunciation /er/. But in longer stem words (except the three just listed) 
<-age> is pronounced either /a:3/ or /1d3/: 
for the 18 words with /a:3/ (therefore containing the minority digraph 
pronunciation), see the the list above 
by far the most frequent pronunciation of stem-final <-age> in words 
with at least one earlier vowel letter before the <a>, e.g. garage 
pronounced /'gerid3/, image, mortgage, village and about 250 other 
words) is therefore /1d3/. Here <a, e> do not form a digraph; <a>isa 
single-letter grapheme pronounced (peculiarly) /1/ - see the previous 
section - and the <e> forms a digraph with the <g>. Again, there is 
no rule by which the other two groups of longer words ending <-age> 
(stressed pronounced /e1dg/, stressed or unstressed pronounced /a:3/) 
can be distinguished from this group - they just have to be learnt. 

An oddity here is the word garage with its two pronunciations (in RP), the 

more French-like /'gwra:3/ and anglicised /‘gzridz/ (see section A.6 in 

Appendix A). 
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Then there are just 14 words with <a> preceded by aconsonant letter and 
separated from word-final <e> by one consonant letter in which the <e> is 
a separate grapheme pronounced /i:/ or /e1/ or sometimes either, namely 
six French loanwords increasingly spelt in English text with French <é>: 
blase, cafe, canape, glace, macrame, pate (‘paste’), plus agape (/‘egapel/, 
‘love feast’ (from Greek), as opposed to /a’ge1p/, ‘open-mouthed’), biennale, 
curare, finale, kamikaze, karate, sesame, tamale. 

The only other exceptions to the rule that <-a.e> (with one intervening 
consonant letter) is a digraph are: ate, which is often pronounced /et/ rather 
than /ert/, have whether pronounced /hev/ (stressed) or /av/ (unstressed), 
and furnace, menace, necklace, palace, pinnace, preface, solace, surface, 
terrace, carafe, gunwale, carcase, purchase, octave with <-ace, -afe, -ale, 
-ase, -ave> pronounced variously /1s, ef, al, as, Iv/. 


10.5 <ai> 
N.B. <air> has a separate entry. For the dual percentages see Notes. 


THE MAIN SYSTEM 


Basic phoneme _ /et/ 43%/79% e.g. paint 


THE REST 
pronounced 
Exceptions to main <ai> /e/ 46%/<1% only in bouillabaisse, said, saith 
system and (usually, nowadays) again(st). See 
Notes 
<ai> /1/ 8%/14% only in bargain, captain, 
chamberlain, chaplain, fountain, 
mountain, porcelain 
<ai> /3/ 4%/7% only in certain, chieftain, coxswain, 
curtain, mainsail (second <ai>), topsail, 
villain 
<ai> /e/ <1% only in Laing, plaid, plait 
<ai> /at/ <1% only in ailuro-phile/phobe, assegai, 


balalaika, banzai, bonsai, caravanserai, 
Kaiser, naiad, samurai, shanghai 


Oddities <aigh> /er/ 
<ais> /at/ 
<ais> /e1/ 
<ait> /e1/ 

2-phoneme (none) 

graphemes 

NOTES 
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only in straight 
only in aisle 
only in palais 


only in distrait, parfait, and trait 
pronounced /trer/ (also pronounced 
/trert/) 


Where two percentages are shown above, the first is that given by Gontijo 
et al. Among these, the percentage for <ai> pronounced /e/ has been 
completely distorted by the high frequency of again, said. | have therefore not 
promoted this correspondence to the main system, but | have re-calculated 
all the percentages for this grapheme omitting those two words. Where they 
differ from the originals, the revised percentages are shown second. 

<a, i> are separate graphemes (with automatic intervening /j/-glide) in 
algebraic, apotropaic, archaic, dais, formulaic, laity, mosaic, prosaic, etc. 


10.6 <air> 


THE MAIN SYSTEM 


Only /ea/ 100% 
phoneme 


THE REST 


pronounced 


Oddity <aire> /ea/ 


e.g. pair. Always stressed except in corsair 
(usually), millionairess (always), mohair 
(always) 


only word-final and only in a few polysyllabic 
words of mainly French origin, namely affaire, 
commissionaire, concessionaire, doctrinaire, 
laissez-faire, legionnaire, millionaire, questionnaire, 
secretaire, solitaire. /r/-linking occurs in 
millionairess - see section 3.6 
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10.7 <ar> 


N.B. <are> has a separate entry. 


THE MAIN SYSTEM 


Basic /ax/ 78% 
phoneme 
Other /d:/ 8% 
phonemes 
/ea/ <1% 
THE REST 
pronounced 
Exception to <ar> /a/ 


main system 


regular in words where <a> is the 

only vowel letter (only exceptions: 
monosyllables in next paragraph); 

in longer words, regular before a 
consonant letter when stressed, 

e.g. farther (exceptions: see both 
paragraphs of Notes); also word-finally 
when stressed, e.g. ajar, cigar, guitar, 
hussar 


only in athwart, award, dwar-f/ves, 
quark pronounced /kwo:k/ (also 
pronounced /kwark/), quart(an/er/ 
et/ic/ ile/z), reward, sward, swarf, 
swarm, swart, swarthy, thwart, towards, 
untoward, war, warble, ward, warden, 
warfarin, warlock, warm, warn, warp, 
wart, whar-f/ves 


initially, only in area, Aries; never 
word-final. Regular medially before a 
vowel letter other than word-final <e>, 
especially where <-e> has been deleted 
before a suffix beginning with a vowel 
letter, e.g. caring, but there are also 
independent examples, e.g. parent - see 
Note. Before a consonant letter, only in 
Scarce, scarcity 


14% does not occur initially; medially, regular 
in the suffixes /-wad(z)/ spelt <-ward(s)>, e.g. 
afterwards, backwards), downwards), 
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forwards), forward, inward, leeward, onward, 
outward, windward (exceptions: towards, 
untoward - see last but one paragraph); word- 
finally, regular when unstressed, e.g. altar, 
peculiar, sugar (exceptions: antimacassar, 
ashlar, attar, avatar, cougar, dinar, lazar, 
samovar, sitar). Otherwise only medial and 
only in an unpredictable ragbag of words, 

e.g. anarchy, awkward, bastard, billiards, 
bombardier, bulwark, coward, custard, dotard, 
gabardine, halyard, innards, lanyard, monarch, 
mustard, scabbard, stalwart, steward, 
vineyard, wizard 


Oddities (none) 
2-phoneme (none) 
graphemes 

NOTES 


<ar> is always a digraph in the following circumstances (some of which 

overlap): 
word-finally, e.g. car, cigar, war; 
in words where <a> is the only vowel letter other than word-final 
<e>), e.g. car, cart, scarce; 
when the next letter is a consonant, e.g. cart, carton, scarce, scarcity; 
when the <e> of word-final <-are> has been deleted before a suffix 
beginning with a vowel letter, e.g. caring (though in these cases the 
<r> also functions as a grapheme in its own right - see section 7.1). 
But where the next letter is a vowel that is not part of a suffix, <ar> 
appears to be a digraph only in adversarial, Aquarius, area, Aries, 
barium, commissariat, garish, gregarious, hilarious, malaria(|), 
multifarious, nefarious, parent, precarious, proletariat, Sagittarius, 
variegated, various, vary, and a fairly large set of nouns/adjectives in 
<-arian>, e.g. agrarian, barbarian (2" <ar>), centenarian and other 
age terms, egalitarian, grammarian, librarian, proletarian, utilitarian, 
vegetarian. In all these cases the <r> is both part of the digraph <ar> 
pronounced /ea/ and a grapheme in its own right pronounced /r/. 
For dual-functioning see section 7.1. Otherwise <a, r> are separate 
graphemes, e.g. Arab, lariat, larynx, pharynx, scarab, scarify, variety. 
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A few words have alternative pronunciations where one requires 
analysing <ar> as a digraph but the other does not, e.g. secretariat, where 
the first <a> and second <r> can be pronounced either /ear/ (with <ar> as 
a digraph and the <r> also a grapheme in its own right pronounced /r/) or 
/er/ (with the two letters functioning separately). 


10.8 <are> 


THE MAIN SYSTEM 


Basic /ea/ 100% only word-final, e.g. care, pare 
phoneme 
THE REST 
pronounced 

Exception to <are> /ax/ <1% only in are when stressed (/a/ when 
main system unstressed) 
Oddities <arr> /a:/ only in bizarrery, carr, charr, parr 

<arre> /a:/ only in barre, bizarre (but /r/-linking occurs 


in bizarrery - see section 3.6) 


<arrh> /a:/ only in catarrh (but /r/-linking occurs in 
catarrhal - see section 3.6) 


2-phoneme (none) 

graphemes 

NOTE 

The only case where final <a, r, e> belong to separate graphemes is in Hare 
Krishna. 

10.9 <au> 


See Notes for dual percentages. 


THE MAIN SYSTEM 


Basic /d:/ 46%/80% e.g. aura, Sauce, autumn, Cause; 
phoneme word-final only in landau, Nassau 


THE REST 


Exceptions to <au> 
main system 


<au> 
<au> 
<au> 
<au> 
Oddities <augh> 
<aul> 
<aur> 
2-phoneme (none) 
graphemes 
NOTES 


pronounced 


/o/ 


/au/ 


/>:/ 


/>:/ 
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43%/1% only in Aussie, Australia, Austria, 
because (also increasingly pronounced, unusually, 
with stressed /a/), cauliflower, laurel, Laurence, 
Sausage, plus a few words also pronounced with 
/D:/: auction, austere, caustic, claustrophobia/c, 
hydraulic, (bacca) laureate. See Notes 


10%/17% only in aunt, draught, laugh(ter) 


1% only in a few more recent French loanwords, 
namely chauffeu-r/se, chauvinis-m/t, gauche, 
hauteur, mauve, saute, taupe 


<1% only in ablaut, Faustian, gaucho, gauleiter, 
glaucoma (also pronounced with /3:/), 
Sauerkraut (twice), umlaut and the Greek letter 
name tau; also in aural when pronounced 
/‘avral/ to distinguish it from oral /'d:ral/ 


only in gauge 


only in aught, caught, daughter, distraught, 
fraught, haughty, (Mc)Naught(on), naught, 
naughty, onslaught, slaughter, taught (and 
contrast draught, laugh(ter) where <au, gh> are 
separate graphemes) 


only in baulk, caulk, haulm. See also <al> under 
<a>, section 10.3 


only in bucentaur, centaur, dinosaur (and 
the names of various dinosaur species, e.g. 
pterosaur), minotaur 


If we follow Crystal (2012: 131-2), ‘more recent’ in terms of loanwords from 


French means after the Great Vowel Shift, which was complete by about AD 


1600. 


Where two percentages are shown above, the first is that given by Gontijo 


et al. (2003). Among these, the high percentage for /v/ is almost entirely 


due to because. In this case, specifically because they show the number of 
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occurrences of because in their database, Gontijo et al. provide enough 
information to re-calculate all the percentages for this grapheme omitting 
because. Where they differ from the originals, the revised percentages are 


shown second. 


There appear to be no cases where <a, u> are separate graphemes. 
For <au> as an elided vowel spelling in restaurant see section 6.10. 


10.10 <aw> 


THE MAIN SYSTEM 


Basic phoneme 

THE REST 

Exception to main system 
Oddity 

2-phoneme graphemes 


NOTE 


/a:/ 


<aw> 


<awe> 


(none) 


100% e.g. awful, crawl, dawdle, paw 

pronounced 

/>1/ <1% only in lawyer, sawyer 

/o:/ only in awe and derivatives which 
retain <e> 


Where the next letter is a vowel (other than in a suffix) or a consonant 
digraph, <a, w> belong to separate graphemes, e.g. in awake, award, 


aware, awry, awhile, caraway, megawatt. 


10.11 <ay> 


THE MAIN SYSTEM 


Basic phoneme 


THE REST 


Exceptions to main 
system 


/et/ 


<ay> 


91% e.g. day. See Note 
pronounced 
/ix/ 8% only finally and only in quay 


and compounds of day: birthday, 


<ay> 


Oddities <aye> 


<aye> 


<ayer> 


<ayor> 


2-phoneme graphemes (none) 


10.12 <e> 
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/e/ 
/e1/ 


/at/ 


/ea/ 


/ea/ 


holiday, Sunday, yesterday, etc., 
except heyday, midday, nowadays, 
today, workaday, which retain /e1/, 
as does holidaying 


<1% only in says 


only in aye - the usual pronunciation 
for the meaning ‘always, still’ 


only in aye, aye-aye. Aye is always 
pronounced /at/ when it means 
‘yes’, sometimes also when it means 
‘always, still’ 


only in prayer pronounced /prea/ 
(‘religious formula’; also pronounced 
/‘pretja/, ‘one who prays’) 


only in mayor and derivitives (but 
there is /r/-linking in mayoral, 
mayoress - see section 3.6) 


N.B. <ea, ear, ed, ee, e.e, eer, er, ere, ew> have separate entries. 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic phoneme /e/ 


Other phonemes /1/ 


47% 


39% 


e.g. bed, invent. Regular when 
it is the only or last vowel 
letter and is followed by at 
least one consonant letter, 

in earlier positions before 
consonant clusters, in stressed 
<ex->, and before <-Cic(al)> 


mainly when unstressed, e.g. 
corset; when stressed, only in 
England, English, pretty and 
Cecily pronounced /'ststliz/ 
and therefore as a homophone 
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/2/ 


THE REST 


Exceptions to main system 


<e> 


<e> 


8% 


5% 


pronounced 


/o/ 


/e1/ 


of Sicily (Cecily is also 
pronounced /'sesili:/). Regular 
in some suffixes 


e.g. be, decent, ether, psyche. 
Regular with <e>-deletion, 
word-finally, and before or in 
certain endings 


regular when unstressed, e.g. 
the (unstressed and before a 
consonant phoneme), artery 


1% in total 


in about 22 more recent French 
loanwords, e.g. (the relevant <e>’s 
are in caps) ambiEnce, cliEntele, 
denouemEnt, détEnte, divertissemEnt, 
Embonpoint, Embouchure, En (suite), 
Enceinte pronounced /pn'sent/ (also 
pronounced /en'setnt/), Enclave 
pronounced /'pnkletv/ (more often 
pronounced /'enkletv/), Encore, 
Ennui, EnsEmble, EntEnte, Entourage, 
Entracte, Entrepreneur, Entree, 
Envelope pronounced /'pnvalaup/ 
(also pronounced /'envalaup/), gEnre, 
rapprochemEnt, rEntier 


in words where <e> is the only vowel 
letter, only in thegn. Otherwise only 
in about 65 more recent loanwords 
mainly from French where French 
spelling has <é>, namely 

(in non-final positions) debris, 
debut, decor, eclair, ecru, elan, 
ingenu, precis; first <e> in debacle, 
debutante, decalage, decolletage, 
denouement, detente, elite, ingenue, 
menage, regime, seance, 
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(Greek) heter-/hom-ogeneity 
pronounced /hetar-/hom- 
audzr'nejjrti: / (usually pronounced 
/hetar-/hom-avudsr'nisjrti: /), 
(Hawaian) ukulele and (Turkish) meze; 
(word-finally) (French) abbe, attache, 
blase, cafe (also pronounced with 
/i:/), canape (also pronounced with 
/it/, hence the invitation | once 
received to a party with ‘wine and 
canopies’), cliche, communique, 
conge, consomme, diamante, fiance, 
flambe, frappe, glace, habitue, 
macrame, manque, outre, pate 
(‘paste’), retrousse, risque, rose 
(‘pink wine’), roue, saute, soigne, 
souffle, touche, (Amerindian/Spanish) 
abalone, (Greek) agape (/*gapel/, 
‘love feast’), (Italian) biennale, finale, 
(Spanish/Nahuatl) guacamole, 
(japanese) anime, kamikaze and 
(Mexican Spanish) tamale; final <e> 
in (French) emigre, expose (‘report of 
scandal’), naivete, protege, recherche, 
resume (‘c.v.’), retrousse, (KiSwahili/ 
Spanish) dengue and (Turkish) meze. 
There is an increasing tendency to 
spell the French loanwords in this list, 
within English text, with <é> 


Oddities <e’er> /ea/ only in e’er, ne’er, where’erand a 
few other archaic contracted forms. 
See section A.9 in Appendix A 


For ‘<i> before <e> except after 
<c>’ see section 6.1 


<ei> /ix/ 69% of pronunciations for <ei> only 
medial and only in caffeine, casein, 
ceiling, codeine, conceit, conceive, 
counterfeit pronounced /‘kauntefi:t/ 
(also pronounced /'kauntefit/), 
cuneiform, deceit, deceive, heinous 
pronounced /'hi:nas/ (also 
pronounced /'hei:nas/), inveigle, 
perceive, plebeian, protein, receipt, 
receive, seize 
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<ei> 


<ei> 


<ei> 


<ei> 


<ei> 
<ei> 


<eigh> 


<eigh> 


<eir> 


<eir> 


<eo> 


/at/ 


/a/ 


/e1/ 


/1/ 


/et/ 


/at/ 


/ea/ 


/13/ 
/a/ 


23% of pronunciations for <ei> only 
in deictic, deixis, eider(down), eidetic, 
eirenic, either, Fahrenheit, feisty, 
gneiss, heist, kaleidoscope, meiosis, 
neither, poltergeist, seismic and 
derivatives, stein 


7% of pronunciations for <ei> only in 
foreign (which must therefore have 
been very frequent in Gontijo et al.’s 
(2003) database) 


All other pronunciations of <ei> 
amount to <1% in total 


only in about 15 words, namely 
abseil, apartheid, beige, deign, feign, 
feint, heinous pronounced /'hetnas/ 
(also pronounced /‘hi:nas/), lei (only 
example in word-final position), 
obeisance, reign, rein, reindeer, seine, 
sheikh, skein, surveillance, veil, vein 


only in counterfeit pronounced 
/‘kauntefit/ (also pronounced 
/'kaunteafi:t/), forfeit, sovereign, 
surfeit 


only in heifer, leisure, seigneur 
only in reveille 


89% of pronunciations for <eigh> 
only in eight, freight, heigh, inveigh, 
neigh, neighbour, sleigh, weigh, 
weight 


11% of pronunciations for <eigh> 
only in height, sleight 


only in theirs) and therefore virtually 
100% 


only in weir, weird 


only in bludgeon, curmudgeon, 
dudgeon, dungeon, gudgeon, 
luncheon, puncheon, (e)scutcheon, 
smidgeon, sturgeon, surgeon, 
truncheon, widgeon 


<eo> 


<eo> 


<eo> 


<es> 


<et> 


<eu> 


<eu> 


<eu> 


<eur> 


<eur> 
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/ux/ 


/a/ 


only in Geoffrey), jeopardy, Leonard, 
leopard 


only in feoffee, feoffment , people 
only in Yeo, yeoman, Yeovil 
only in demesne 


only word-final and only in about 
20 more recent French loanwords, 
namely ballet, beret, bidet, bouquet, 
buffet (‘food’), cabaret, cabriolet, 
cachet, cassoulet, chalet, crochet, 
croquet, duvet, gilet, gourmet, 
parquet, piquet, ricochet, sachet, 
so(u) briquet, sorbet, tourniquet, 
valet pronounced /'veletr/ (also 
pronounced /'veltt/). /t/ surfaces 
in balletic, parquetry, valeting - see 
section 7.2. In these and all other 
cases <e, t> are separate graphemes 
- for examples see Notes 


only in rheum(ati-c/sm), sleuth, plus 
adieu, lieu (also pronounced /lu:/), 
purlieu if pronounced with /-jur/, in 
which case <i> is pronounced /j/ 


only in chauffeuse, coiffeuse, 
masseuse, milieu 


only in pasteurise pronounced 
/‘parstfaraiz/ (also pronounced 
/‘parstjara1z/ 


non-finally, only in secateurs; 
otherwise only word-final and only 
in about 12 more recent loanwords 
of French origin, e.g. chauffeur 

(if stressed on <eur>), coiffeur, 
connoisseur, entrepreneur, hauteur, 
masseur, poseur, provocateur, 
raconteur, repetiteur, restaurateur, 
seigneur and a few other rare words 


only in amateur, chauffeur (if 
stressed on <au>), grandeur. 
/r/-linking occurs in amateurish - 
see section 3.6 
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<eur> /ua/ only in pleurisy, the <r> is also 
pronounced /r/. For dual-functioning 
see section 7.1. See section 5.6.5 for 
the increasing replacement of /ua/ 
by /3:/ 


<ey> /ix/ except in geyser pronounced /'gi:za/, 
only final and only in abbey, alley, 
attorney, baloney, barley, blarney, 
blimey, cagey, chimney, chutney, 
cockney, comfrey, coney, donkey, 
dopey, flunkey, fogey, galley, gooey, 
hackney, hockey, homey, honey, 
jersey, jockey, journey, key, kidney, 
lackey, malarkey, matey, medley, 
money, monkey, motley, nosey, 
palfrey, parley, parsley, pokey, pulley, 
storey, tourney, turkey, valley, volley 


<ey> /et/ never initial; medially, only in 
abeyance, heyday, word-finally, 
only in bey, convey, fey, grey, hey, 
lamprey, obey, osprey, prey, purvey, 
survey, they, whey 


<ey> /at/ only in geyser pronounced /'ga1za/ 
(usually pronounced /'gi:za/) 
<eye> /at/ only in eye 


<eyr> /ta/ only in eyrie ; the <r> is also 
pronounced /r/. For dual-functioning 
see section 7.1 


<ey’re> /ea/ only in they’re. See section A.9 in 
Appendix A 
<e’re> /ta/ only in we’re. See section A.9 in 
Appendix A 
<ez> /et/ only in laissez-faire, pince-nez, 
rendezvous 
2-phoneme graphemes <eu> as only in various words and names 
2-phoneme _ of Greek origin, e.g. eucalyptus, 
sequence eucharist, eudaemonic, eugenic, eulogy, 
/jux/ eunuch, euphemism, euphorbia, 


euphoria, eurhythmic, euthanasia, 
leukaemia, neural, neurone, neurosis, 
Odysseus, Pentateuch, Perseus, 
pneumatic, pneumonia 
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and other words derived from 

Greek Trvebua pneuma (‘breath’) or 
TIvevuWwv pneumon (‘lung’), pseudo) 
and all its derivatives, therapeutic, 
Theseus, zeugma, plus (non-Greek) 
deuce, euchre, Eustachian, feu, 
feud(al), neuter, neutr-al/on, teutonic 
and some other very rare words 


<eu> as only in aneurism, pasteurise 
2-phoneme pronounced /'pa:stjaraiz/ (also 
sequence pronounced /'pa:stfaratz/) 
/je/ 

<eur> as only in eureka, Europe and 
2-phoneme_ derivatives (where the <r> is 
sequence also pronounced /r/ - for dual- 
/jue/ functioning see section 7.1) and 


liqueur pronounced /It'kjua/ 


NOTES 


If we follow Crystal (2012: 131-2), ‘more recent’ in terms of loanwords from 
French means after the Great Vowel Shift, which was complete by about AD 
1600. 
Except in the cases noted in the Oddities, in <eo, et, ez> the <e> isa 
separate grapheme - cf. especially someone. 
<e, i> are separate graphemes pronounced /i:, 1/ (with an intervening 
/j/-glide) in albeit, atheis-m/t(id, dei-fy/sm/st, hetero/homo-geneity, 
nucleic, pantheism, reify, reinforce, reinstate. 
<e, u> are separate graphemes pronounced /i:, 3/ (again with an 
intervening /j/-glide) in coleus, linoleum, mausoleum, museum, nucleus, 
petroleum. 
For many examples of medial <e, o> as separate graphemes see below. 
Percentages for <eo, eu, eur, ey> are not worth giving because so few 
words are involved. 
For instances of <e> as an elided vowel see section 6.10. 
The default pronunciation of <e> as a single-letter grapheme is /e/, but 
here are some categories for guidance: 
regular before geminate and doubled consonant spellings, e.g. ebb, 
beck, speckled, cheddar, hedge(n), ineffable, egg, trekkie, bell, bellow, 
biennial, berry, blessed /'blestd/, stress, wretch(ed), sett, settle, 
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embezzle. Extension: all the words ending <-ette>. Only exception: 
retch pronounced /ri:t{/ 
regular in other words where <e> is the only vowel letter and is 
followed by at least one consonant letter, e.g. bed, phlegm, trek. See 
section 11.3 for a teaching rule relevant to ..VC monosyllables 
regular before consonant clusters in words with at least one earlier 
vowel letter separated from the relevant <e> by at least one 
consonant letter, e.g. accept, bedeck, except, inflect, reject, present 
(verb), prevent, repent, subject /sab'dgekt/ (verb, with stress on <e>; 
the noun is pronounced /'sabdgrkt/). Extension: 3 words with <eCe> 
where <e.e> is not pronounced /i:, e1/ (for words where <e.e> is so 
pronounced see section 10.17): allege, clientele, cortege 
mostly when <e> is followed by more than one consonant letter and 
there is at least one later vowel letter, e.g. better, bevvy, enter, freckle, 
pendulum, phlegmatic, splendid, terrible, terrify 
when followed by a single consonant letter and the endings <-ic(al)>, 
e.g. academic, arithmétic (adjective), arithmetical, ascetic, athletic, 
atmospheric, genetic, heretical, parenthetical, pathetic. Extension: 
ethic-al/s. Only exceptions: acetic (which is thus differentiated from 
ascetic), emic, graphemic, phonemic, scenic, with /i:/, arithmetic 
(noun), chdleric, climacteric, héretic, with the relevant <e> pronounced 
/a/. The stress always falls on the syllable spelt with the relevant <e> 
except in the four words just shown with different stresses 
in a few words (some very frequent) before <ver>, e.g. beverage, 
clever, ever, every, never, reverend, several. However, the exceptions 
are more numerous: cantilever, fever, leverage) with /i:/; persevere, 
revere, revers-e/al, severe with /1/ 
in a further ragbag of non-final occurrences, e.g. first <e> in celery, 
deference, element, emery, excellent, exile, exit, levee, machete, 
penance, preference, present (noun, adjective), president, reference, 
relevant, separate, seven, tether; second <e> in decrepit, presidential, 
replenish, third <e> in deferential, preferential, referential; also bevy, 
credit, debit, discretion, edit, fetish, heron, inherit, intrepid, lemon, 
leper, levy, medal, melon, merit, metal, pedal, pedant, perish, relish, 
special, tenant, tenon, tepid, very, xenon. 

The task then is to define the circumstances in which <e> has other 

pronunciations. These can be summarised as follows. 
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Word-final <e> is mainly ‘silent’, i.e. part of a digraph (split or not), 
trigraph or four-letter grapheme. It is ‘pronounced’: 

1) as /et/, onlyin the 42 words listed under the exception ‘<e> pronounced 
/e1/’ above 

2) as /ix/ in be, he, me, she, the (when stressed), we, ye; aborigine, acme, 
acne, adobe, agave, anemone, apostrophe, bocce, catastrophe, coyote, 
dilettante, epitome, extempore, facsimile, (bona) fide, forte, furore, hebe, 
hyperbole, karate, machete, menarche, minke, nepenthe, oche, posse, 
psyche, recce, recipe, reveille, sesame, simile, stele, strophe, tagliatelle, 
tsetse, ukulele, vigilante. 

Non-final <e> is pronounced /i:/ in: 

1) hundreds of words where the final <e> of <e.e> has been deleted 
before a suffix beginning with a vowel letter, e.g. competing, schematic 
- see sections 6.3 and especially 6.4 

2) anumber of words where <e> is followed by a single consonant letter 
other than <r> and then by 

any of <eo, ia, io, iou, iu> followed word-finally by a single consonant 
letter or none, e.g. chameleon (first <e>), meteor (first <e>); sepia; 
comedian, congenial, genial, Grecian, remedial (second <e>); cohesion, 
completion, lesion and many more words ending in <-esion, -etion>, 
senior; egregious (second <e>), facetious, ingenious, specious, tedious; 
genius, magnesium, medium, tedium 

<-ien-ce/cy/t>, e.g. obedience, expediency (second <e>), leniency, 
convenient, expedient, ingredient. 

In all these words (and the first two exceptions listed next) the <e> in 

question is stressed. Exceptions: discretion, special with /e/, dandelion, 

denial with /1/, elegiac with second <e> pronounced /92/. 

3) avery few words when unstressed before word-final <o(n/r)>: galleon, 
Odeon, video, second <e> in chameleon, melodeon, meteor (all with 
automatic intervening /j/-glide) 

4) the ending <-eous> pronounced /itjas/, e.g. aqueous, beauteous, 
courteous, (sub)cutaneous, erroneous, gaseous pronounced /'gesi:jas/, 
hideous, instantaneous, nauseous pronounced /'nd:zitjas/, simultaneous 
and about 70 other words. But N.B. there are many words ending 
in <-eous> where the <e> is part of a digraph with the preceding 
letter, e.g. advantageous, gaseous pronounced /'getfas/, gorgeous, 
nauseous pronounced /'nd:3as/, righteous, siliceous and a set of 
words in <-aceous> pronounced /'etfas/, e.g. cretaceous, curvaceous, 
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5) 


6) 


7) 


8) 


herbaceous, sebaceous and about 100 others, mostly scientific and all 
very rare 

a number of words when stressed before a single consonant letter and 
word-final <a, o>, e.g. beta, edema, ego pronounced /'i:gau/ (also 
pronounced /'egau/), emphysema, eta, hyena, magneto, schema, theta, 
torpedo, tuxedo, verbena, veto, etc. 

plurals of a few nouns with singular ending <-is> pronounced 
/1s/ and plural ending <-es> pronounced /i:z/, e.g. (Greek) analyses 
(/a'nelisizz/, the singular verb of the same spelling being pronounced 
/‘wnealatziz/), apotheoses, axes, bases (/‘weksi:z, 'bersi:z/, plurals of 
axis, basis; axes, bases as the plurals of axe, base are pronounced 
(regularly) /‘'sks1z, 'bers1z/), crises, diagnoses, emphases, exegeses, 
nemeses, oases, periphrases, synopses, (anti/hypo/meta/syn-)theses, 
(Latin) amanuenses, testes, plus (Greek singulars) diabetes, herpes, 
litotes, pyrites, (a stray Greek plural with singular in <-s>) Cyclopes, 
and (other Latin plurals) appendices, cicatrices, faeces, interstices, 
mores, Pisces 

the stressed prefixes <de-, e-, pre-, re-> pronounced /dir:-, i:-, pri:-, 
rir-/ in, e.g., dethrone, egress, preschool, rephrase 

alveolar, apotheosis, camellia, cathedral, cedar, choreograph, demon, 
ethos, femur, genus, harem, legal, lemur, leotard, lethal, mimeograph, 
negus, neon, osteopath, pecan, penal, penis, peony, pleonasm, rebus, 
regal, renal, retch (pronounced /ri:t{/ (also pronounced /ret{/), secant, 
theory, thesis (but not its compounds), venal, venial, etc., (first <e> in) 
abbreviate, appreciable, cotoneaster /ka'tauni:jesta/, creosote, decent, 
diabetes, egret, ether, febrile, feline, geodetic, heliotrope, immediate, 
inebriated, leonine, mediocre, meter, metre, recent, regent, etc. 


Carney would place all the words in categories 3 and 4, and those in category 


8 where <e> is followed by a vowel grapheme pronounced /a/, under /1a/. 


The only words in which <e> is pronounced /1/ in stressed syllables are 


England, English, pretty and Cecily pronounced /'stsili:/ and therefore as a 


homophone of Sicily (Cecily is also pronounced /'sestli:/). Categories where 


/1/ is the regular pronunciation of unstressed <e> are: 


the unstressed prefixes <be-, de-, e-, ex-, pre-, re-> pronounced 
/b1, di, 1, 1ks/1gz, pri, r1/ in, e.g., before, beholden, decline, deliver, 
effective, efficient, extreme, examine, precede, predict, regale, reject 

some occurrences of the ending <-ed> - see section 10.15 
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the endings <-efy, -efied> pronounced /1fa1(d)/, which occur in just 
four words: liquefy, putrefy, rarefied, stupefy 
the ending <-ety> pronounced /iti:/ in anxiety, dubiety, entirety, 
gaiety, moiety, naivety, nicety, notoriety, (im)piety, (im) propriety, 
sobriety, society, surety, variety 
the noun plural and third person singular present tense verb endings 
spelt <-es> and pronounced /1z/ after <c, ch, g, s, sh, Z> pronounced 
variously /s, z, J, 3, tf, d3/ - see the entries for those consonants in 
sections 3.6.6, 3.6.8, 3.7.3, 3.7.4, 3.6.2, 3.6.4 respectively. Exceptions: 
plurals of (Greek) nouns, etc., listed above 
the unstressed noun/adjective endings <-ess, -less, -let, -ness> 
pronounced /tIs, Irs, lrt, nis/, e.g. goddess, listless, booklet, madness 
the superlative adjective ending <-est>, e.g. biggest, grandest 
the archaic second and third person singular verb endings <-est, 
-eth>, e.g. gavest, goeth 
mainly before final <t>, e.g. in ashet, brisket, budget, buffet (‘strike’), 
corset, dulcet, facet, fillet, gannet, gullet, nugget, plummet, punnet, 
russet, secret, tuffet, valet (also pronounced with /e1/ and no /t/) 
and about 150 other words. For final <et> pronounced /e1/ see the 
Oddities. 
There is also a ragbag of other words with non-final <e> pronounced 
/1/, e.g. allegation, employ, forest, hallelujah, integral (when pronounced 
/‘Intigral/; also pronounced /in'tegral/), kitchen, mannequin, regalia, 
subject (noun /'sabdgikt/, with stress on <u>; the verb is pronounced 
/sab'dgekt/), vinegar, women; first <e> in anecdote, antelope, barometer 
and all the instruments ending in <-ometer> (but not kilometer or other 
compounds of meter), celebrity, consecrate, diocese, eccentric, ellipse, 
elope, enamel, integrate, negate, neglect, sequential, second <e> in elegant, 
elephant, peregrine, and many others. 

Examples of non-final <e> pronounced /a/ include every unstressed 
final <-en> (e.g. alien) except in women (/'w1imin/), plus artery, bolero 
(/‘bolarau/, ‘garment’), soviet, first <e> in coterie; second <e> in elevate, 
the first <e> in the ending <-ence> in, e.g., audience, conscience, 
convenience, ebullience, experience, omniscience, obedience, prurience, 
resilience, salience, science; the <e> in the endings <-ency, -ent> in, e.g., 
expediency, leniency, absent, (in)clement, convenient, ebullient, expedient, 
incipient, lenient, orient (noun), omniscient, obedient, prescient, present 
(noun/adjective), prurient, resident, resilient, salient, sentient, subservient, 
transient; also, in nouns ending <-ment>, e.g. complement, compliment, 
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document, element (note the second <e> too), experiment, ferment, 


fragment, implement, increment, instrument - on this last group see also 
section 6.8. 


10.13 <ea> 


N.B. <ear> has a separate entry. 


THE MAIN SYSTEM 


Basic phoneme /i:/ 73% e.g. beach 
THE REST 
pronounced 
Exceptions to <ea> = /e/ 21% In about 60 words, namely: Beaconsfield; 
main system treacher-ous/y, bread, breadth, dead, dread, 


(a)head, lead (the metal, plus derivatives 
leading, leaded), meadow, read (past tense 

and participle), Reading (Berkshire, in first map 
(1611) spelt Redding), (al)ready, spread, (in) 
stead, steadfast, steady, thread, tread(le); deaf, 
breakfast; dealt, health, jealous, realm, stealth, 
wealth, zealous, zealot; dreamt, seamstress, 
cleanly (adjective, plus derivative cleanliness), 
cleanse, leant, meant; leapt, weapon, (a)breast; 
peasant, pheasant, pleasant; measure, pleasure, 
treasure; sweat, threat(en); breath, death; 
feather, heather, leather, weather, endeavour, 
heaven, heavy, leaven and other derivatives not 


listed 
<ea>  /et/ 6% only in break, great, steak, yea, Yeat(e)s 
Oddities <eah> /ea/ only in yeah 
<eau> /b/ only in bureaucracy, bureaucratise 
<eau> /a/ only in bureaucrat(ic 
<eau> /au/ only word-final and only in bandeau, beau, 


bureau, chateau, flambeau, gateau, plateau, 
portmanteau, rondeau, tableau, trousseau and 
a few other very rare words. For the plurals of 
these words see /z/, section 3.6.7, and <x>, 
section 9.42 
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2-phoneme <eau> as only in beauty and derivatives 
grapheme 2-phoneme 

sequence 

/jux/ 
NOTES 


The roughly 20 words listed above with <ead> pronounced /ed/ contrast 
with about 6 pronounced /i:d/: bead, knead, lead (verb), mead, plead, read 
(present tense). The <-ead> pronounced /ed/ group is one of only five 
cases where the pronunciation of a phonogram/rime is more predictable as 
a unit than from the correspondences of the separate graphemes, and there 
are enough instances to make the rule worth teaching; see section A.7 in 
Appendix A. 

<e, a> are separate graphemes pronounced /i:, 1/ only in lineage; /i:, 3/ 
in area, azalea, cereal, cornea, creativity, European, fealty, idea, Jacobean, 
(bacca)laureate, miscreant, nausea, panacea, theatre, urea; /it, «/ in 
beatitude, caveat, cotoneaster, deactivate, genealogy, meander, oleander, 
preamble, react, realign, /it, e1/ in create, creation, delineate, nauseate, 
reagent. In all these cases there is an automatic intervening /j/-glide. 

<e, a> are/belong to separate graphemes also in a set of words in which 
<e> has not been deleted before suffixes beginning with a vowel letter, in 
order to mark <c, g> as pronounced /s, dg/ and not /k, g/, e.g. noticeable, 
changeable - for more detail see section 6.4. 


10.14 <ear> 


THE MAIN SYSTEM 


Basic phoneme /1a/ 67% medially only in afeard, arrears, 
beard, and (with <r> also a grapheme 
in its own right pronounced /r/ - for 
dual-functioning see section 7.1) 
bleary, weary; otherwise only word- 
final and only in appear, arrear, blear, 
clear, dear, drear, ear, fear, gear, 
hear, near, rear, sear, Shear, smear, 
Spear, tear (‘moisture from eye’), year 
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Other phoneme /ea/ 


THE REST 


Exceptions to <ear> 
main system 


<ear> 
Oddities (none) 
2-phoneme (none) 
graphemes 
NOTES 


1% 


pronounced 


/31/ 


/ax/ 


only word-final and only in (for(e)-) 
bear, pear, swear, tear (‘rip’), wear 


29% never word-final, and only in dearth, 
earl, early, earn, earnest, earth, heard, 
hearse, learn, pearl, rehearse, (re)search, 
yearn 


4% only in hearken (also spelt, more regularly, 
harken), heart, hearth 


All the words with final <ear> allow /r/-linking - see section 3.6. 
Despite the percentage for <ear> pronounced /3:/ | have not promoted 


this correspondence to the main system because it occurs in so few words 


(though some have very high frequency). 

<e, ar> are separate graphemes pronounced /i:, 3/ in cochlear, linear, 
nuclear, /it, a:/ in rearm; (with <a, r> as separate graphemes) /i:, a, r/ in 
rearrange. |n all these cases there is an automatic intervening /j/-glide. 


10.15 <ed> 


THE MAIN SYSTEM 


Basic phoneme /d/ 


Other phoneme /t/ 


62% 


38% 


in past tense and participle endings 
of regular verbs whose stems end in 
a vowel letter or in a consonant letter 
other than <d> 


in past tense and participle endings 
of regular verbs whose stems end ina 
consonant letter other than <t> 
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THE REST 


(None). 


NOTES 


Where the stem of a regular verb ends in <(d)d, (t)t> pronounced /d, t/ the 

<-ed> ending is pronounced /1d/, e.g. added, decided, matted, ousted. This 

also applies in: 
a few adjectives which are derived from or resemble past participles 
but have /1d/ rather than the expected /d, t/, but often with a 
different meaning, e.g. accursed, aged (/‘e1dgid/ ‘elderly’ vs /e1d3d/ 
‘having ... years’), beloved (/b1'lavid/ ‘the loved one’ vs /br'lavd/ 
‘adored’), blessed (/'blestd/ ‘holy’ vs /blest/ ‘consecrated’), cragged, 
crooked (/‘krukid/ ‘untrustworthy’ vs /krukt/ ‘at an angle’), Crutched 
(Friars), cursed (/'k3:std/ ‘damnable’ vs /k3:st/ ‘swore badly/put a 
hex on’), cussed (/'kasid/ ‘stubborn’ vs /kast/ ‘swore mildly’), deuced, 
dogged (/'dogid/ ‘persistent’ vs /dogd/ ‘followed’), fixed (/'fiks1d/ 
‘persistent’ vs /fikst/ ‘mended’), horned (owl), jagged (/'dgegid/ ‘with 
sharp points’ vs /dgegd/ past tense of jag), learned (/'I3:n1d/ ‘wise’ 
vs /I3:nd/ regular past tense of learn), (bow/one/three-)legged, naked, 
ragged (/'regid/ ‘torn, exhausted’ vs /regd/ past tense of rag), 
rugged, sacred, supposed (/sa'pauzid/ ‘apparent’ vs (/sa'pauzd/ past 
tense of suppose), wicked, wretched. In (ac)cursed, blessed, crooked, 
Crutched, cussed, deuced, fixed, wretched, not only does the /1/ 
surface (see section 7.2) but the /t/ voices to /d/ 
the past participle verb ending <-ed> pronounced /i1d/ before 
adverbial <-ly>, e.g. advisedly, allegedly, assuredly, barefacedly, 
composedly, confusedly, deservedly, determinedly, fixedly, markedly, 
relaxedly, (un)reservedly, supposedly, unabashedly, unashamedly, 
undisguisedly, unrestrainedly. Again, in barefacedly, fixedly, markedly, 
relaxedly, not only does the /1/ surface (see section 7.2) but the /t/ 
voices to /d/ 
the <ed> element in a very few nouns in <-ness> formed from past 
participles, e.g. determinedness, preparedness. |In preparedness not 
only does the /1/ surface (see section 7.2) but also /r/-linking occurs 
(see section 3.6) and the <r> is both part of the grapheme <are> 
pronounced /ea/ and a grapheme in its own right pronounced /r/. For 
dual-functioning see section 7.1. 
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Given the phonological contexts, <ed> is 100% predictable. 


Outside the verb endings listed, <e, d> are always separate graphemes, 
e.g. in bed, biped, bred, led, quadruped, shed. 


10.16 <ee> 


N.B. <e.e, eer> have separate entries. 


THE MAIN SYSTEM 


Basic /ix/ 100% e.g. beech, free, seen 
phoneme 
THE REST 
pronounced 
Exceptions to <1% in total 


main system 


<ee> /et/ only word-final and only in about 13 words 
where French spelling has <ée>, namely 
corvee, dragee (‘sugar-coated sweet’ 
pronounced /'dra:zer/; also pronounced 
/‘dre1dgi:/), entree, epee, fiancee, levee 
(‘reception or assembly’, also pronounced 
with /i:/), matinee, melee, nee, negligee, puree, 
Soiree, toupee. There is a growing tendency to 
spell these words in English with <ée> 


<ee> /1/ only in been when unstressed, breeches, 
cheerio /btn,'brit{iz, tftris'jau/ 


<ee> /ux/ only in leeward pronounced /'lu:wad/ (also 
pronounced /'li:wad/) 


Oddities (none) 
2-phoneme (none) 
graphemes 

NOTE 


<e, e> are separate graphemes only in afew unusual suffixed forms, e.g. freer, 
freest, weer, weest (comparative and superlative forms of the adjectives free, 
wee), freest, freeth, seest, seeth (/'frixjist, ‘frisj1O, 'si:jrst, ‘sixj10/, archaic 
second and third person singular present tense forms of the verbs free, see), 
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sightseer /'sattsi:ja/ (for more detail see section 6.4). There might then be 
a barely perceptible difference in pronunciation between two words spelt 
seer. disyllabic /'si:ja/ ‘person who sees’ vs monosyllabic /s1a/ ‘person with 
second sight’. 


10.17 <e.e> 


Occurs only where the second <e> is word-final. 
See Note for all categories and for how this split digraph is defined, and 
see section 11.4 for a teaching rule relevant to all split digraphs except <y.e>. 


THE MAIN SYSTEM 


Basic phoneme /ix/ 100% e.g. effete, grapheme, phoneme, 
scene, swede 


Other phoneme Jeri <1% only in crepe, fete, 
renege, suede, Therese 
/krezp, fert, r1'ne1g, sweid, ta're1z/ 


THE REST 

Exceptions to main system strictly speaking, none, but see Note 
Oddities (none) 

2-phoneme graphemes (none) 

NOTE 


The split digraph <e.e> is defined as covering words where the word-final 
<e> is separated from the leading <e> by one consonant letter other than 
<r, W, X, y> and the leading <e> is not preceded by a vowel letter and the 
digraph is pronounced either /i:/ or /e1/. Unlike <a.e>, no extensions are 
needed. The definition covers both words where the intervening consonant 
letter is an independent grapheme and words where the <e> is also part of 
a digraph <ce, ge, ve> - see sections 3.7.4, 3.7.6-7 and 3.8.4, and section 
7.1 for dual-functioning. Exceptions where the leading <e> is a separate 
grapheme and the word-final <e> only forms a digraph with the intervening 
consonant letter: allege, annexe, clientele, cortege with the penultimate <e> 
pronounced /e/ (cf. also creche), college, privilege, sacrilege, sortilege with 
the penultimate <e> pronounced /1/. There are very few English words 
ending <-ege>, and the five just mentioned are most of them, apart froma 
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few very obscure and obsolete terms, and protegé, which is increasingly spelt 
like that, with a French acute accent and the final <e> always pronounced 
separately: /'protazer/. The only other words in which <e, e> separated 
by a single consonant letter are separate graphemes appear to be hebe, 
machete, naivete, stele, ukulele. See also section A.6 in Appendix A. 


10.18 <eer> 


THE MAIN SYSTEM 


Only /1a/ 100% except in eerie, where <r> is also 

phoneme pronounced /r/ (for dual-functioning 
see section 7.1), only word-final, e.g. 
beer. Many words with this ending allow 
/r/-linking - see section 3.6 


NOTE 


The only words in which <e, er> are separate graphemes appear to be freer, 
weer (comparatives of free, wee). 


10.19 <er> 


N.B. <ere> has a separate entry. 


THE MAIN SYSTEM 


Basic /3:1/ 24% regular medially when stressed before 

phoneme a consonant letter, e.g. berth, exert, 
herd; also word-finally when stressed, 
e.g. aver, defer, deter, her, infer, inter, 
prefer, refer, transfer 


Other /a/ 65% regular word-finally when unstressed, 

phonemes e.g. other, patter; also in prefixes hyper-, 
inter-, per-, super- when not stressed on 
<er> 


/ta/ <1% never word-final; initially, only in era; 
regular medially before a vowel letter 
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when stressed, e.g. anterior, arterial, 
bacteria, cafeteria, criteri-a/on, 
deteriorate, diphtheria, experience, 
funereal, hero, imperial, inferior, 
material, mysterious, period, posterior, 
series, superior, ulterior, wisteria. \n all 
these words the <r> is both part of the 
digraph <er> pronounced /1a/ anda 
grapheme in its own right pronounced 
/r/ - for dual-functioning see section 7.1 
- and the <er> is stressed. Also see Notes 


THE REST 


pronounced 


Exceptions to <er> /ea/ 9% only in bolero (‘dance’), concierge, 

main system recherche, scherzo, sombrero. \n bolero, 
sombrero the <r> is both part of <er> 
spelling /ea/ and a grapheme in its own right 
pronounced /r/. This is also true of a few 
suffixed forms of words in the next section 
with <-ere> pronounced /ea/, e.g. compering. 
For dual-functioning see section 7.1 


<er> = /et/ 1% only word-final and only in a few French 
loanwords, namely atelier, croupier, dossier 
pronounced /'dpsi:jer/ (also pronounced 
/'‘dosisja/), foyer pronounced /'fwarjet, ‘foijer/ 
(also pronounced /'fot1ja/), metier, rentier 


<er> /ax/ <1% only in Berkeley, Berkshire, Cherwell, clerk, 
derby, Ker pronounced /ka:/ (also pronounced 
/k3:/), Sergeant 


Oddities <err> = /3/ in stem words only in err, but frequent in 
consonant-doubling before suffixes, e.g. 
preferred. All other occurrences of <e, rr> 
consist of two graphemes pronounced /e, r/, 
e.g, terrible, terrier 


<erre> /ea/ only in parterre 


2-phoneme (none) 
graphemes 
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NOTES 


Words ending <er> and the prefixes hyper-, inter-, per-, super- permit 
/r/-linking (see section 3.6) before following words/stems beginning with 
a vowel phoneme, e.g. dearer and dearer /‘diararan'diara/, hyperactive, 
interactive, peroxide, supererogatory. 

In the case of medial <er> pronounced /1a/ plus /r/-linking there are 
also a few instances arising from suffixation of words belonging to the next 
section, e.g. adherents, coherence, interfering, interferon, perseverance. 
However, in other suffixed forms from words in the next section the 
pronunciation of the <e> changes and, although /r/-linking occurs, 
the <r> is a single-function grapheme pronounced /r/, e.g. spherical, 
atmospheric, austerity, reverence, severity, (in)sincerity, this is also true of 
errant, derived from err. 


10.20 <ere> 


For absence of percentages see Note. 


THE MAIN SYSTEM 


Basic phoneme /13/ regular word-finally, e.g. here, mere, sere, 
sphere, ad/co-here, atmosphere, austere, 
belvedere, cashmere, interfere, persevere, 
revere, severe, (in)sincere. In hereon 
/r/-linking - see section 3.6 - occurs without 
<e>-deletion (which would produce heron) 


THE REST 

pronounced 
Exceptions to main <ere> /ea/ only word-final and only in ere, there, 
system where and a few polysyllabic words 


of French origin, namely ampere, 
brassiere, cafetiere, commere, compere, 
confrere, misere, premiere. /r/-linking 
- see section 3.6 - occurs in compering, 
wherever, etc.; also in thereupon 
without <e>-deletion 


<ere> /3:/ only in were when stressed 
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Oddities (none) 
2-phoneme (none) 
graphemes 

NOTE 


Gontijo et al. (2003) do not recognise /3:/ as a pronunciation of <ere>; 
presumably the version of RP they were using has were pronounced /wea/ 
and/or they analysed all its occurrences as unstressed /wa/. Because of this 
it was not possible to calculate percentages for <ere>. 


10.21 <ew> 


THE MAIN SYSTEM 


For both categories see Notes. 


Basic phoneme /ux/ 15% e.g. crew, shrewd, strewn, 
view, yew 
Frequent /ju:/ 84% e.g. few, nephew, new, newt, 
2-phoneme steward 
sequence 
THE REST 
pronounced 
Exceptions to main <ew> /au/ 1% only in sew, sewn, Shrewsbury 
system plus shew(ed), shewn (archaic 


spellings of show(ed), shown) 


Oddities (none) 

Other 2-phoneme <ewe> as 2-phoneme only in ewe, Ewell, Ewelme 
grapheme sequence /jur/ 

NOTES 


<ew> pronounced /ju:/ occurs medially only in newel, Newton, pewter, 
steward; otherwise, only where there is no futher vowel letter and only in 
(closed) hewn, lewd, mews, newt, thews; (open) clerihew, curfew, curlew, 
few, hew, knew, mew, mildew, nephew, new, pew, phew, sinew, skew, smew, 
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spew, stew; also dewif pronounced /dju:/ rather than /dgu:/. Except in these 


words and the few Oddities <ew> is always pronounced /u:/ - the high 


frequency of few, knew, new is presumably responsible for the few words 


with /jur/ having a much higher percentage of correspondences than those 


with /u:/. There seem to be no cases where <e, w> are separate graphemes. 


N.B. For vocalic graphemes beginning with (‘silent’) <h> see section 9.17. 


10.22 <i> 


N.B. <ie, i.e, igh, ir> have separate entries. 


THE MAIN SYSTEM 


For all these categories and the absence of percentages see Notes. 


Basic phoneme _ _/1/ 


Other phonemes /i:/ 


/at/ 


/j/ 


THE REST 


Exceptions to main <i> 
system 


regular in initial position, e.g. in, is, it, and in 
medial position before a consonant letter (except 
where <e>-deletion has occurred), e.g. his, 

live (verb), sit, this, with. See section 11.3 fora 
teaching rule relevant to ..VC monosyllables 


regular word-finally, e.g. kiwi, safari, spaghetti; 
frequent medially (with /j/-glide), e.g. ambience, 
alien, hernia, medial(ly) 


regular medially where <e>-deletion has 
occurred, e.g. writing, and (with /j/-glide) where 
<i> is the first vowel letter in the word and is 
followed by another vowel letter, e.g. bias 


only medially before a vowel letter, e.g. adieu, 
behaviour, lieu, purlieu, saviour, union, (inter) 
view 


pronounced 


/ze/ only in absinthe, impasse, ingenu(e), 
lingerie pronounced /'lengari:/ (also 
pronounced /'londgarer/), pince-nez, 
timbale, timbre 
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<i> /o/ only in lingerie pronounced /'londgere1/ 
(also pronounced /‘lengzari:/) 


<i> /2/ in a large set of adjectives/adverbs ending 
in <-ibl-e/y> pronounced /-abal, -abli:/, 
e.g. possibl-e/y, all of which can also be 
pronounced with /1/. Also in a few adverbs 
ending <-arily> when not stressed on 
the <a>, which becomes elided (see 
section 6.10), so that the <i> in <-ily> 
is pronounced /2/, e.g. necessarily, 
voluntarily pronounced /'nesasrali:, 
‘volantrali:/ (also pronounced /nesa'sertli:, 
volan'tertli:/ with <i> pronounced /1/ 
and the preceding <a> stressed and 
pronounced /e/). Otherwise perhaps only in 
Missouri (second <i>) 


Oddities <ia> /t/ only in carriage, marriage 


<ia> /a/ only in fuchsia, miniature, parliament, 
pharmacopoeia. \n words like crucial, initial 
| count the <i> as part of a digraph with the 
preceding consonant letter - see <ci, ti> in 
sections 9.10 and 9.36 


<ia> /at/ only in diamond 


<io> /a/ only in cushion, fashion, marchioness, 
stanchion. |In words like nation, lesion, 
vision, lotion, fusion | count the <i> as part 
of a digraph with the preceding consonant 
letter - see <si, ti> in sections 9.31 and 
9.36. In all other cases <i, o> are separate 
single-letter graphemes - see many 
examples in the Notes 


<is> /at/ only in island, isle(t), lisle, viscount 


<is> /ix/ only in chassis, commis (chef), coulis, debris, 
precis, verdigris pronounced /'v3:dr1gri:/ 
(also pronounced /'v3:digri:s/), vis-a-vis 
(last <is>) 


<it> /ix/ only in esprit, petit mal, wagon-lit 


2-phoneme (none) 
graphemes 
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NOTES 


Gontijo et al. (2003) analyse a great many occurrences of medial <i> before 
another vowel letter as being pronounced /1/, whereas | analyse them as 
being pronounced /i:/ + /j/-glide. Re-allocation proved impossible, hence 
the absence of percentages. 

Except in the cases noted in the Oddities, in <ia, io, is, it> the <i> is the 
whole or part of a separate grapheme. In particular, for <i, a> see below. 

For instances of <i> as an elided vowel see section 6.10. 

The regular pronunciations of <i> as a single-letter grapheme are 
complicated, and best set out in a flowchart - see Figure 10.1 and the 
following numbered paragraphs keyed to it. 


FIGURE 10.1: FLOWCHART TO DETERMINE THE REGULAR PRONUNCIATIONS 
OF <i> AS A SINGLE-LETTER GRAPHEME. 


<i> 
“ NY N 
“ NY N 
“ NV N 
word-initially (1) NV word-finally (7) 
NY medially NY 
/1/ “z N /ix/ 
“ N 
before a vowel letter before a consonant letter 
NY 
NY NV NY NY 
if <i> is 1st vowel NV with <e>- NY 
letter in word (2) otherwise deletion (5) otherwise (6) 
“ N NY NY 
/at/ v N /at/ /t/ 
Kv N 
pronounced as pronounced as 
a consonant (3) a vowel (4) 
/j/ /ix/ 


So the regular pronunciations of <i> as a single-letter grapheme are: 

1) In initial position: /1/, e.g. iguana, ill, incognito, Indian, indigo, inn, 
innocent, irritate, is, it. Exceptions, almost all with /at/: iambic, Iberian, 
ibex, ibis, ichor, icicle, icon, idea, identical, identity, ideology, idle, 
idol, iodine, ion, lonic, iota, irate, iris, lrish, iron-y/ic, isinglass, isobar, 
isogloss, isosceles and other compounds of (Greek) iso- (‘equal’), isolate 
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(from Italian isola from Latin insula ‘island’), item, itinerary, ivory, ivy. 
Only other exceptions: impasse, ingenu(e), with /x/ 

2) Medially where <i> is the first vowel letter in the word and is followed 
by another vowel letter: /az/ (plus /j/-glide) in a large set of words, 
e.g. bias, biology and several other compounds beginning <bio->, 
briar, client, diabolic and several other compounds beginning <dia->, 
friable, friar(y), giant, hiatus, liable, liar, lion, phial, pioneer, pliant, 
pliers, riot, sciatica, science, striation, triad, trial, triumph, viaduct, vial, 
violin, etc. Exceptions (all with /i:/ plus /j/-glide): clientele, fiancée), 
fiasco, fiord, kiosk, liais-e/on, liana, miasma, pianist, piano (/pi:'jenau/, 
with 3 syllables; in rapid speech also pronounced /'pjznau/ with <i> 
pronounced as consonant /j/ and 2 syllables - cf. category (3) below), 
piastre, trio, viola 

3-4) Medially where <i> is followed by another vowel letter but is not the 
first vowel letter in the word, it can be pronounced as a consonant or 
a vowel: 

3) The consonantal pronunciation of <i> as /j/ occurs only medially before 
a vowel letter or digraph mostly pronounced /a/ and almost always after 
the vowel bearing main stress: 

in two groups of words: a group ending <-iary>: apiary, auxiliary, 
aviary, breviary, domiciliary, incendiary, intermediary, pecuniary, 
stipendiary, subsidiary, topiary (no exceptions, but this is a small 
set), and a group ending <-ion>: battalion, billion, bunion, champion, 
companion, dominion, million, minion, onion, opinion, pavilion, pinion, 
union (lots of exceptions - see category 4 below); 

otherwise only in: behaviour, brilliancy, colliery, junior, saviour, senior, 
spaniel, plus (before a full vowel) milieu and, in rapid speech, brilliant, 
envious before /a/ and (before a full vowel and, exceptionally, with the 
stress on the vowel after the <i>) pronunciation. |In words like brilliant, 
envious, million, pronunciation (and cf. piano above), there is overlap 
with the next category because such words can be pronounced with 
consonant /j/ or vowel /i:/ plus /j/-glide, e.g. million as /'muiljan/ 
(2 syllables) or /'milizjan/ (3 syllables). Acoustically, the difference is 
very slight 

4) The regular vocalic pronunciation of <i> as a single-letter grapheme 
in medial position (but not as the first vowel letter in the word - see (2) 
above) when followed by a vowel letter is /iz/ plus /j/-glide, e.g. 

before <a, e, 0, Ou, U> pronounced /a/ (Carney would place these 
words under /1a/): ammonia, anaemia, bacteria, begonia, camellia, 
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chlamydia, (en)cyclopaedia, hernia, hysteria, media, myopia, salvia, 
sepia, utopia; amiable, dutiable, enviable, variable; myriad; aerial, 
congenial, jovial, managerial, material, memorial, radial, remedial, 
serial and about 450 others ending in <-ial>; barbarian, comedian, 
grammarian, guardian, pedestrian, ruffian, thespian and about 200 
others ending in <-ian>; dalliance, luxuriance, radiance, variance; 
radiant, suppliant, variant, alias; alien, audience, convenience, 
ebullience, expedience, experience, obedience, prurience, salience; 
expediency, leniency, convenient, ebullient, expedient, lenient, 
obedient, orient (/'dtritjant/ noun), pinochle, prescient, prurient, 
salient, sentient, subservient, transient; soviet, twentieth, etc.; period, 
sociological, axiom; accordion, bastion, battalion, billion, bullion, 
carrion, centurion, clarion, collodion, criterion, ganglion, medallion, 
mullion, oblivion, scorpion, scullion, stallion (this group with <-ion> 
are rarely if ever pronounced with /j/, unlike similar words listed in (3) 
above); chariot, patriot, commodious, compendious, curious, dubious, 
felonious, glorious, melodious, obvious, odious, previous, scabious, 
serious, studious, tedious and about 100 others ending in <-ious>; 
atrium, bacterium, barium, compendium, gymnasium, medium, opium, 
potassium, radium, stadium, tedium and about 200 others ending 
in <-ium>; genius, radius, also second <i> in amphibious, bilious, 
billiards, brilliant, criteria, delirium, editorial, fastidious, hilarious, 
historian, histrionic, idiom, idiot, industrial, juvenilia, memorabilia, 
millennia, omniscience, omniscient, perfidious, perihelion, reptilian, 
resilience, resilient, trivia(), vitriol, third <i> in incipient, initiate 
(noun), insidious, insignia, invidious, militaria; 

before <a, ae, a.e, ai, ar, e, O> pronounced as full vowel phonemes: 
abbreviate, ap/de-preciate, associate, audio, calumniate, caviar, 
foliage, luxuriate, mediaeval, milliamp, negotiate, orient (/prix'jent/, 
verb), oubliette, patio, polio, radio, ratio, serviette, studio, verbiage; 
also first <i> in conscientious, orgiastic, partiality, psychiatric, 
speciality, second <i> in affiliate, bibliography, histrionic, inebriation, 
insomniac, officiate, superficiality, vitriolic, third <i> in initiate 
(verb). In almost all these words (the only exceptions among those 
listed are sociological, medi(a)eval, orient (verb), oubliette, serviette, 
bibliography, histrionic, inebriation, superficiality, vitriolic) the main 
stress falls on the vowel before the relevant <i>. The consonant letter 
before the relevant <i> is hardly ever <c, s, sc, t> because <ci, si, 
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sci, ti> are almost always digraphs pronounced /Jf/ or /3/ (so the 
<i> is not pronounced separately) - see these graphemes’ entries in 
chapter 9, and see also category (6) below - but in a few words the 
<i> is pronounced separately as /i:/ plus /j/-glide; examples among 
the words listed are ap/de-preciate, associate, negotiate, patio, ratio, 
conscientious, partiality, speciality, initiate 
Exceptions with <i> not pronounced /i:/ (allwith stressed <i> pronounced 
/at/ plus /j/-glide): alliance, certifiable, defiant, denial, elegiac, 
leviathan, verifiable, anxiety, dubiety, notoriety, (im)piety, (im) propriety, 
sobriety, society, variety 

5-6) Medially where <i> is followed by a consonant letter: 

5) Itis pronounced /ar/ in thousands of words where the final <e> of <i.e> 
has been deleted before a suffix beginning with a vowel letter - see 
sections 6.3 and especially 6.4, e.g. bridal, cited, primal, riding, spinal, 
tribal, writing. See also most exceptions to next category 

6) Otherwise, mainly /1/, e.g. blink, divide (first <i>), piffle. This is 
especially true: 

before geminate and doubled consonant spellings, e.g. pick, pickle, 
biddie, bridge, midget, difficult, higgledy-piggledy, pillow, cinnamon, 
tipple, mirror, kiss, missal, hitch, pitcher, little, skittle, skivvy, drizzle, 
fizz. Extensions: all the words ending <-ville> and a few other words, 
e.g. big, brink, province, wind ‘stiff breeze’ (but see the group with 
/at/ and those spelt <-ibl-e/y> below, plus other exceptions within 
the lists below) 

in the endings <-ic(al), -ify>, e.g. critic(al, parasitic, beautify 

before a single consonant letter follwed by the endings <-ic(al)>, 
e.g. critical, parasitic. In all such words except impolitic, impoliticly 
(injudiciously’), politic(s), politicly (‘judiciously’), the stress falls on 
the relevant <i>, but political follows the rule (more on this in the last 
paragraph of these Notes) 

before the ending <-ly> in adverbs formed from adjectives in <-y>, 
e.g. happily. Note that addition of the suffix changes the stem-final 
vowel from /i:/ (in my analysis) to /1/ 

before the endings <-cial, -cian, -cious, -ssion, -tion, -tious>, e.g. 
beneficial, official, electrician, magician, auspicious, delicious, fission, 
mission, coition, fruition, fictitious, propitious, plus initial, provincial, 
siliceous, suspicion. \n all these words the stress falls on the <i> before 
the ending. 
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Exceptions: 

with /e/: absinthe, lingerie pronounced /'lanzeari:/ (also pronounced 
/‘londgeret/), meringue, pince-nez, timbale, timbre 
with /a/: a large set of adjectives/adverbs ending in <-ibl-e/y> 
pronounced /abal, ablix/, e.g. possibl-e/y, all of which can also be 
pronounced with /1/. Also in a few adverbs ending <-arily> when not 
stressed on the syllable spelt with <a>, which becomes elided (see section 
6.10), so that the <i> in <-ily> is pronounced /a/, e.g. necessarily, 
voluntarily pronounced /'nesasraliz, ‘volantraliz/ (also pronounced 
/nesa'sertliz, volan'tertli:/ with <i> pronounced /1/ and stress on the 
preceding syllable spelt with <a> which is pronounced /e/) 
with /i:/: albino, ambergris, amino, ballerina, batik, casino, chic, cliché, 
concertina, diva, farina, frisson, gilet, kilo, lido, litre, maraschino, 
marina, massif, merino, modiste, mosquito, motif, ocarina, piquan-t/ 
cy, scarlatina, semolina, visa, first <i> in graffiti, kiwi, martini, 
migraine, milieu, second <i> in aperitif, bikini, incognito, libido 
with /at/ in a number of words before a single consonant letter and 
word-final <a, o>, e.g. angina, giro, impetigo, lino, mica, proviso, 
rhino, saliva, silo, vagina, viva (voce) (‘oral exam’); otherwise only in 
mic /matk/. In all these words the syllable spelt with <i> is stressed 
with /at/ in a number of words where <i> is the only or last vowel 
letter and is followed by more than one consonant letter: child, Christ, 
indict, mild, ninth, paradigm, pint, whilst, wild and the <-ign, -ind> 
groups: align, assign, benign, consign, design, malign, resign, sign 
(sub-exception: ensign, with /a/); behind, bind, blind, find, grind, hind, 
kind, (re)mind, rind, wind pronounced /watind/ (‘turn’; contrast wind 
pronounced /wind/ ‘stiff breeze’). The <-ind> pronounced /aind/ 
group is one of only five cases where the pronunciation of a phonogram/ 
rime is more predictable as a unit than from the correspondences of the 
separate graphemes, and there are enough instances to make the rule 
worth teaching; see section A.7 in Appendix A 
with /at/ in an unpredictable ragbag of other words, e.g. binary, 
bison, finance, final, first <i> in finite (but none of its derivatives), 
library, licence, license, micron, migrant, minus, paradigmatic, 
piracy pronounced /'patrasi:/ (also pronounced /'ptirasi:/), primacy, 
primary, primate, primus, rival, silent, sinus, siphon, sisal, strident, 
tiger, trident, vibrant, vital 

7) The regular pronunciation of <i> as a single-letter grapheme in final 

position in words with at least one earlier vowel letter is /i:/, e.g. anti, 
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bikini, graffiti, khaki, kiwi, muesli, spaghetti, svengali, wiki. Exceptions 

(all with /at/): alibi, alkali, (anno) domini, (a) fortiori/ posteriori/priori, 

(lapis) lazuli, quasi, rabbi and some Latin plurals, e.g. alumni, bacilli, 

cacti, foci, fundi (/'fandat/, plural of fundus ‘inner corner of organ’; 

contrast fundi pronounced /'fundi:/, either South and East African 

English for ‘expert/skilled person’, or a member of the fundamentalist, 

uncompromising wing of the German Green Party), fungi, gladioli, 

and lots of Latin biological terms with anglicised pronunciations, e.g. 

leylandii, plus Greek bronchi, chi, phi, pi, psi, xi. 

There appear to be only nine words with <i> as the only vowel letter, and in 
word-final position; most have /ar/, namely the greeting Hi!, the pronoun 
|, and the Greek letter names (as pronounced in English) chi, phi, pi, psi, 
xi, but even this tiny set has two exceptions with /i:/: the musical term mi, 
and ski. 

Almost all words ending /1k(al/s) spelt <-ic(al/s)> have stress on the 
preceding syllable. Exceptions: Arabic, arithmetic (noun), drsenic (noun, if 
pronounced /‘a:sanik/ with three syllables), biopic (pronounced /'batjaupik/ 
by those who recognise its origin as an abbreviation of ‘biographical picture’, 
= film), catholic (if pronounced /'kze@alik/, with three syllables), cérvical 
/'s3rvikal/ (as in cérvical vertebrae, in the neck - but see below), chdleric, 
climdcteric, héretic, impolitic(ly), lunatic, politic(ly/s), rhétoric, turmeric - 
but arithmétic (adjective), arithmétical, arsénic (/a:'senik/, adjective), 
herétical, political, rhetorical follow the rule; so does bidpic (pronounced 
/bat'jopik/ (rhymes with myopic) by those who apply the general ‘stress the 
syllable before <ic>’ rule, thus proving its psychological reality). 

Arsenic (noun) and catholic pronounced with three syllables are 
exceptions, but both more often have the central written vowel elided (see 
section 6.10) and are pronounced /'‘arsnik, 'k#6l1k /, with two syllables. 
Phonologically, this makes them regular - they are stressed on the syllable 
preceding /1k/ spelt <ic>. However, in terms of predicting word stress from 
written forms, they are still exceptions - they are stressed on the syllable 
containing the second vowel letter before the <ic> instead of the first. 

Other words with two pronunciations, but differing in stress, are (fly) 
agaric /a'gerik/ (regular) or /‘egartk/ (exception), chivalric /f1'velrtk/ 
(regular) or /‘fivalrtk/ (exception); on chivalricthe Oxford English Dictionary 
says ‘The first pronunciation is that sanctioned by the poets’. Extensions: 
Greek plurals such as erdtica; the modern coinage emoticon. Also note 
the modern contrast in meaning between cervical /'s3:vikal/ in cérvical 
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vertebrae (in the neck) and /s3:'vatkal/ in cervical cancer/smear (in the 


cervix/entrance to the womb). 


The vowel preceding <ic> always has a ‘short’ pronunciation (except in 


aphasic with /e1/, acetic, emic, graphemic, phonemic, scenic with /i:/, and 


biopic pronounced /'bataupik/, chromic, phobic and all its compounds, with 


/au/), as does the <i> in <ic>, except in cervical pronounced /s3:'vatkal/. 


10.23 <ie> 


N.B. <i.e> has a separate entry. On the percentages see Notes. 


THE MAIN SYSTEM 


Basic /ix/ 73% 
phoneme 
THE REST 

pronounced 


Exceptions to <ie> /at/ 
main system 


<ie> /e/ 
<ie> /1/ 
Oddities <ier>  /1a/ 


<ieu> = /ur/ 


2-phoneme (none) 
graphemes 


e.g. brief, diesel, achieve, calorie 


21% in a very small set of words in word-final 
position, namely die, fie, hie, lie, pie, tie, vie 


6% only in friend 


<1% only in (hand/nec) kerchief, mischief, 
mischievous, sieve 


never initial; only in (medially) fierce, 

pierce, tierce; (word-finally) bandolier, bier, 
bombardier, brigadier, cashier, cavalier, 
chandelier, chevalier, clavier, corsetier, frontier, 
fusilier, gondolier, grenadier, halberdier, pier, 
tier, vizier and a few other very rare words. 
<ier> is always stressed, except that frontier is 
pronounced either /'frantza/ or /fran’tra/. In all 
other words ending <ier> the <i> and the <er> 
are/belong to separate graphemes and belong 
to separate syllables - see Notes 


only in lieu pronounced /lu:/ (also pronounced 


/\juz/) 
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NOTES 


Even though Gontijo et al. (2003) analyse final <ie> in words where there is 
at least one earlier vowel letter as being pronounced /1/ it was possible to 
re-allocate all such words to /i:/ and recalculate the percentages. 
<i, e> are/belong to separate graphemes in anxiety, convenient, leniency, 
Science, twentieth and all other words with those endings, plus adieu, alien, 
client(ele), conscientious, diet, fiery, medieval, milieu, oubliette, quiet(us), 
serviette, spaniel, soviet, (inter/re-)view. All have an intervening /j/-glide 
except adieu, (inter/re-)view, spaniel, where the <i> spells /j/ after a 
preceding consonant anyway. 
<i, er> are, or belong to, separate graphemes in: 
all three-syllable comparative adjectives in <-ier> pronounced /i:ja/ 
formed from two-syllable adjectives ending in <-y>, e.g. easier, 
happier 
barrier, espalier with /isja/, colliery with /je/, dossier with /itja/ or 
/isjet/, drier, flier, pliers with /ata/ 
a few words in which the <i> always or sometimes forms a digraph 
with the preceding consonant letter: crosier, hosier, osier, brazier, 
crozier, glazier sometimes pronounced with /3a/ (alternatively with 
/ixja/); soldier with /d3a/. 


10.24 <i.e> 


Occurs only where <e> is word-final. 
See Notes for all categories and for how this split digraph is defined, and 
see section 11.4 for ateaching rule relevant to all split digraphs except <y.e>. 


THE MAIN SYSTEM 


Basic phoneme /at/ 97% e.g. bike, live (adjective), time 


Other phoneme /ix/ 3% only in about 88 mostly French 
loanwords, e.g. police, quiche 


THE REST 
Exceptions to main system strictly speaking, none, but see Notes 
Oddities (none) 


2-phoneme graphemes (none) 
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NOTES 


The split digraph <i.e> is defined as covering words where the <e> is 
separated from the <i> by one consonant letter other than <r> and the <i> 
is not preceded by a vowel letter and the digraph is pronounced either /at/ 
or /ix/. The definition covers both words where the intervening consonant 
letter is an independent grapheme and words where the <e> is also part 
of a split digraph <ce, ge, ve> - see sections 3.7.4, 3.7.6-7 and 3.8.4, and 
section 7.1 for dual-functioning. See also section A.6 in Appendix A. 

The familiar /at/ pronunciation occurs in many hundreds of words and 
does not need further illustration. The /i:/ pronunciation occurs only in 
about 88 (mostly French) loanwords; those which fit the main definition 
just given (for extensions see below) are: caprice, police; automobile, 
imbecile; centime, regime; beguine, benedictine (‘liqueur’), benzine, 
bombazine, brigantine, brilliantine, chlorine, citrine, cuisine, dentine, 
figurine, gabardine, guillotine, iodine, latrine, libertine, limousine, machine, 
magazine, margarine, marine, mezzanine, morphine, nectarine, nicotine, 
opaline, phosphine, plasticine, pristine, quarantine, quinine, ravine, routine, 
sardine, sistine, strychnine, submarine, tagine, tambourine, tangerine, 
terrine, tontine, trampoline, tyrosine, undine, vaccine, vitrine, wolverine; 
anise, cerise, chemise, expertise, valise; elite, marguerite, petite, suite; 
naive, recitative. 

Extensions: 

1) There are four words where <i.e> pronounced /at/ is separated by two 
consonant letters forming a digraph: blithe, lithe, tithe, writhe; 

2) There are 18 words where <i.e> pronounced /i:/ is separated by two 
letters forming a consonant digraph: fiche, niche pronounced /ni:/, 
pastiche, quiche; fatigue, intrigue; chenille; antique, boutique, clique, 
critique, mystique, oblique, physique, pique, technique, unique; pelisse; 

3) There are three words where <i.e> pronounced /i:/ is separated by <s, 
t> pronounced separately: artiste, dirigiste, modiste; 

4) There are two words where <i.e> pronounced /i:/ is separated by the 
three letters <squ> pronounced /sk/, with <qu> forming a consonant 
digraph: bisque, odalisque. 

Exceptions (all words with at least one earlier vowel letter, except give, live 

(verb)) where the <i> is a separate grapheme pronounced /1/ and the <e> 

forms a digraph with the intervening consonant letter: 

a set of words ending in <-ice> in which <-ce> is a digraph pronounced 
/s/: accomplice, apprentice, armistice, artifice, auspice, avarice, 
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benefice, bodice, caddice, chalice, cicatrice (but the plural cicatrices 
is pronounced /sika'trarsi:z/, cockatrice, coppice, cornice, cowardice, 
crevice, dentifrice, edifice, hospice, jaundice, justice, lattice, malice, 
notice, novice, office, orifice, poultice, practice, precipice, prejudice, 
pumice, service, solstice, surplice. All words in <-ice> with no earlier 
vowel letter are pronounced with /ats/, as are advice, device, sacrifice, 
suffice - and see above for caprice, police 
one word ending in <-ice> pronounced /1f/: liquorice (also pronounced 
with /s/) 
one word ending in <-ife> pronounced /1f/: housewife (‘sewing kit’), 
pronounced /‘hazif/ 
a set of words ending in <-ine> in which <-ne> is a digraph 
pronounced /n/: bowline, clandestine pronounced /klzen'destin/ (also 
pronounced /'kleandastatn/, in which case <i.e> is a split digraph), 
compline, crinoline, (pre)destine, determine, discipline, doctrine, 
engine, ermine, examine, famine, feminine, genuine, heroine, illumine, 
imagine, intestine, jasmine, marline, masculine, medicine, peregrine, 
saccharine, sanguine, urine, vaseline 
five words ending in <-ise> in which <-se> is a digraph pronounced 
/s/: mortise, practise, premise, promise, treatise 
several words in <-ite> in which <-te> is a digraph pronounced /t/: 
composite, definite, exquisite, favourite, granite, hypocrite, infinite, 
opposite, perquisite, plebiscite, requisite 
a large number of words ending in <-ive>, e.g. adjective, massive, all 
of which are pronounced with /1v/ except naive, recitative, which end 
in /ixv/ and therefore have the split digraph pronounced /i:/ and are 
so listed above; also give, live (verb) - most words in <-ive> with no 
earlier vowel letter have /atv/, e.g. chive, dive, five, jive, live (adjective), 
Shrive, strive, swive, thrive, wive. 
There are very few English words ending in <-ige>. The only two to which 
the regular pronunciation /atdg/ applies are (dis)oblige (both stressed on 
the <i> before <ge>). Otherwise there are only the two exceptions vestige, 
with unstressed /1d3/, and prestige, with stressed /i:3/. 
The only words in which a final <e> after <i>+consonant is pronounced 
separately appear to be anime (from Japanese), (bona) fide (Latin) and 
campanile (from Italian). 
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10.25 <igh> 


THE MAIN SYSTEM 


Only phoneme /at/ 100% e.g. sigh, sight. Always follows a 
consonant letter, and is therefore never 
word-initial 


NOTES 


In my analysis, there are no cases where <i, gh> are separate graphemes. 

Provided that analysis is accepted, this is one of the very few rules 
without exceptions in the whole system. However, as far as | can aScertain 
(even digging around for rare and archaic words), there seem to be just 26 
stem words in the entire language containing this grapheme: high, nigh, 
sigh, thigh; bight, blight, bright, fight, flight, fright, hight, knight, light, 
might, night, plight, right, sight, slight, tight, wight, wright; alight (in its 
‘descend from vehicle’ sense; in its ‘on fire’ sense it is derived from light (a 
fire)), delight, Blighty, sprightly - some of which are of very high frequency 
- plus many derivatives. Perhaps the shortage of such words is why the rule 
is 100% reliable. 

Clymer (1963/1996) cited two different supposed pronunciation rules 
that are relevant here: 

11. When the letter /is followed by the letters gh, the j usually stands for its 
long sound and the ghis silent. 

25. When ght is seen in a word, gh is silent. 

He said rule 25 has 100% ‘utility’ (= reliability) and rule 11 only 71%. 

Rule 25 really is 100% accurate in its own terms because it covers not 
only the 21 words listed above containing <ight> but also the only word 
containing <aight>: straight, and the only five words with <eight>: eight, 
freight, height, sleight, weight. However, the rule is unhelpful because (a) 
telling learners that some letters are ‘silent’ may be confusing (for more on 
that see section A.5 in Appendix A); (b) it seems to me much more logical 
to analyse the <gh> in all the relevant words as part of a vowel grapheme 
with the preceding vowel letter(s); (c) as it stands, the rule does not specify 
the pronunciation of the preceding vowel grapheme. 

Rule 11 is also unhelpful on grounds (a) and (b). Also, as several 
commentators have pointed out, it fails to reach 100% reliability only because 
it is underspecified. If formulated as ‘After a consonant letter, <igh> is 
always pronounced /at/’, it is 100% reliable and well worth teaching. The 
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restriction ‘after a consonant letter’ is to exclude the six words with <aight/ 


eight> listed in the previous paragraph, plus six with just <eigh>: heigh, 


inveigh, neigh, neighbour, sleigh, weigh. 


For more about Clymer’s rules see chapter 11. 


10.26 <ir> 


THE MAIN SYSTEM 
Basic phoneme _ /3:/ 100% 
THE REST 
pronounced 
Exceptions to main 
system 
<ir> = /19/ 
<ir> = /at/ 
<ir> as 2-phoneme 
sequence /ata/ 
Oddities <ire> as 2-phoneme 


sequence /ata/ 


e.g. fir 


<1% in total 


only in emir, fakir, nadir pronounced 
/‘netdia, ne'dia/ (also pronounced 
/'netda/), kir, kirsch, souvenir, tapir 


only in iron /‘atan/ 


only medially but always stressed and 
mainly where <-e> has been deleted 
from words in the following paragraph, 
e.g. aspiring, desirous, expiry, spiral, 
tiring, but there are a few independent 
examples, e.g. biro, giro, pirate, virus. In 
all cases the <r> is both part of <ir> and 
a grapheme in its own right pronounced 
/r/. For dual-functioning see section 

7.1. In deliri-ous/um, by contrast, <i, 

r> are separate graphemes, the <i> is 
pronounced /1/, and the <r> has only one 
function and is (of course) pronounced /r/ 


only word-finally and only in ac/in/ 
re-quire, admire, a/con/in/per/re/ 
tran-spire, attire, desire, dire, empire, 
entire, expire, fire, hire, (be/quag-)mire, 
quire, saltire, samphire, sapphire, satire, 
Shire, sire, spire, e)squire, tire, umpire, 
vampire, wire. Many of these words allow 
/r/-linking, e.g. aspiring, spiral - see 
previous paragraph and section 3.6 
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<irr> = /3:/ only in chirr, shirr, whirr and suffixed 
forms of verbs in <-ir>, e.g. stirring; 
otherwise <i, rr> are separate graphemes, 
e.g. in irrigate, irritant. \n (e.g.) stirring, 
whirring there is /r/-linking (see section 
3.5) and <rr> is both part of <irr> anda 
grapheme in its own right pronounced /r/. 
For dual-functioning see section 7.1 


Other 2-phoneme (none) 


graphemes 


N.B. For word-final <I, le, m, n> involved in 2-phoneme sequences with /a/ 
see sections 9.20-23. 


10.27 <o> 


N.B. <o.e, Oi, 00, Or, Ore, OU, OW, Oy> have separate entries. 


THE MAIN SYSTEM 


For all these categories see Notes. 


Basic phoneme /v/ 41% predominant in words with no 
other vowel letter, e.g. box, from, 
of, on, not, sock 


Other /ux/ 18% only in zoology (first <o>) and 
phonemes derivatives and 10 other stem 
words - see Notes; several are very 
frequent 
/au/ 16% e.g. go, lotion, most, ocean, roving. 


Regular where <e>-deletion has 

occurred, before some word-final 
consonant clusters, before some 

endings, word-finally, and in 


<-osis> 
/a/ 14% e.g. bishop, Briton, oblige, union 
[Al 9% only in a restricted set of words, 


e.g. above, come, done, monk 
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THE REST 
pronounced 
Exceptions to 2% in total 
main system 
<o> /1/ only in pigeon (taking <ge> as 
pronounced /d3/; compare pidgin), 
women 
<o> /0/ only in bosom (1 <o>), wol-f/ves, 
wolfram, wolverine, Wolverhampton (1°* 
<o>), woman 
<o> as 2-phoneme only in once, one 


sequence /wa/ 


Oddities <oa> /au/ only in (initially) oaf, oak, oast, oat, oath; 
(medially) approach, bloat, boast, boat, 
broach, cloak, coach, coal, coast, coat, 
coax, croak, encroach, float, foal, foam, 
gloaming, gloat, goad, goal, goat, groan, 
groat, hoax, loach, load, loaf, loam, 
loan, loath, loathe, moan, moat, poach, 
reproach, roach, road, roam, roan, roast, 
shoal, soak, soap, stoat, throat, toad, 
toast, woad; (finally) cocoa, whoa 


<oa> /o:/ only in abroad, broaden) 


<oar> /o:/ only in boar, board, coarse, hoar, hoard, 
hoarse, oar, roar, soar 


<oar> /2/ only in cupboard, larboard, starboard 


<oat> /au/ only in boatswain pronounced /'bausan/ 
(also pronounced /'bautswetn/) 


<oe> /ix/ only in amenorrhoea, amoeba, 
apnoea, coelacanth, coelenterate, 
coeliac, coelom, coenobite, coenocyte, 
diarrhoea, dyspnoea, foetal, foetid, 
foetus, gonorrhoea, logorrhoea, oedema, 
oenology, oesophagus, oestrogen, oestrus, 
pharmacopoeia, phoenix, pyorrhoea, 
subpoena. Many of these words have 
alternative spellings in <e>, especially in 
US spelling 
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<oe> 


<oe> 
<oe> 
<oer> 
<oeu> 
<oh> 


<ol> 


<olo> 
<os> 


<ot> 


Other 2-phoneme (none) 
graphemes 


NOTES 


/au/ 


/ur/ 
/A/ 

/dt, va/ 
/ur/ 
/au/ 


/au/ 


/31/ 
/avu/ 


/au/ 


except in throes, only word-final and 
only in aloe, doe, floe, foe, hoe, oboe, roe, 
schmoe, sloe, toe, woe 


only in canoe, hoopoe, shoe 

only in does(n’t) 

only in Boer pronounced /b93:, bua/ 
only in manoeuvre 

only in doh, kohl, Oh, ohm, soh 


only in folk, Holborn, holm, yolk and old- 
fashioned pronunciation of golf as /gauf/ 


only in colonel 
only in apropos 


only in argot, depot, entrepot, haricot, 
jabot, matelot, potpourri, sabot, tarot, 
tricot. /t/ surfaces in sabotage, saboteur 
- see section 7.2 


<0, a> (with intervening /w/-glide) belong to separate graphemes in coagulate, 


coalesce, coalition, coaxial, Croatia, hypoallergenic, oasis, protozoa, etc. For 


cases where <o, e> belong to separate graphemes see coerce, etc., below. 


<ol, olo, os, ot> are single graphemes only in the Oddities listed. 


For instances of <o> as an elided vowel see section 6.10. 


The default pronunciation of <o> as a single-letter grapheme is /p/, but 


here are some categories for guidance: 


regular in words with no other vowel letter, e.g. bob, boll (also 


pronounced with /au/), box, cod, crotch, dog, doll, from, knoll, lock, 


long, loll, loss, moll, odd, of, off, on, plonk, poll (‘parrot’), troll, shop, 


yon. Extensions: begone, gone. Exceptions: boll (sometimes), droll, 


poll (‘head, vote’), roll, scroll, stroll, toll with /au/, wolf with /u/. See 


section 11.3 for a teaching rule relevant to ..VC monosyllables 


in a few words where <o> is the last vowel letter, e.g. alcohol, belong, 


compost, methanol, micron, parasol, phenol, protocol 
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regular before geminate and doubled consonant spellings (in addition 
to relevant words in the previous category), e.g. bobbin, cockle, locket, 
coddle, codger, lodge, coffee, toggle, atoll, dollop, holly, jolly, lolly, 
polly, topple, lorry, across, blossom, crotchet, bottle, s(dhemozzle, 
first <o> in follow, connotation. Extensions: garrotte, gavotte 
mostly before consonant clusters (in addition to relevant words in 
previous caregories), e.g. confident, costume, doldrums, donkey, 
obstinate, ostensible, posterior, tonsils, but there are quite a few 
exceptions - see later categories 
before a single consonant letter followed by the endings <-ic(al)>, 
e.g. atomic, boric, carbolic, chaotic, exotic, frolic, harmonic, 
logical, phonic(s), tonic, topic(al). This includes all the words 
ending <-ological>, e.g. biological, sociological. Exceptions: biopic 
pronounced /'batjaupik/, chromic, phobic and all its compounds, with 
/9u/ 
in final <-ogue>, e.g. analogue, catalogue, dialogue, plus baroque 
as the first <o> in the suffix <-ology> pronounced /'vladgi:/, e.g. 
biology, chronology, etc. 
in a few other non-final occurrences, e.g. admonish, bother, demolish, 
grovel, homage, hovel, hover, moderate, modest, moral, novel, novice, 
olive, polish, poverty, project (noun), proper, provenance, proverb, 
robin, scholar, sovereign, first <o> in gondola, provocation. 

The task then is to try to define when <o> has other pronunciations. 

<o> is pronounced /wa/ only in once, one. 

No rules can be given for when <o> is pronounced /a/, except that in 
stem words it never occurs word-finally, and initially it occurs only in onion, 
other, oven, so here is a list of its medial occurrences: above, accomplice, 
accomplish, amok, become, borough, brother, Cadogan, colour, colander 
(also pronounced with /v0/), Colombia (seond <o>), come, comfort(able), 
comfrey, comfy, company, (en)compass, conjure (‘do magic tricks’), 
constable, coven, covenant, (dis/re/un-)cover, covert pronounced /'kav3:t/ 
(also pronounced /'kuav3:t/), covet(ous), covey, coz, cozen, done, dost, 
doth, dove, dozen, dromedary, front, frontier, glove, govern, honey, London 
(first <o>), lovage, love, Lovell, Monday, monetary, money, monger and its 
compounds, mongrel, monk, monkey, Monroe,Montgomery (twice), month, 
mother, none, nothing, plover, shove, shovel, slovenly, smother, sojourn 
(also pronounced with /p/), some, somersault, son, sponge, thorough, ton, 
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tonne, tongue, twopence, twopenny, windhover, won, wonder, worrit, worry. 
Some words which used to have /a/ in RP now have /b/ instead, e.g. combat, 
comrade, conduit, Coventry. 

Similarly, no rules can be given for when <o> is pronounced /u:/, 
but it occurs only for the first <o> of zoology and derivatives with initial 
<zoo-> (Greek, ‘living thing’) spelling two syllables pronounced /zu:'wo/ 
if the second syllable is stressed, otherwise /zu:wa/, and 10 other stem 
words: caisson pronounced /ka'su:n/, canton (‘provide accommodation’, 
pronounced /kzn'turn/), catacomb, do, lasso, to, tomb, two, who, womb, 
plus derivatives including cantonment, lassoing, whom, and a few from 
words in which <o.e> is a split digraph pronounced /u:/, e.g. approval, 
movie, removal, and the proper nouns Aloysius /xlu:'wifas/, Romania, 
Wrotham /'ruxtam/. 

<o> is pronounced /au/: 

in hundreds of words where final <e> has been deleted, e.g. dosage, 
dotage, global, modal, polar, rosy, roving, tonal 

regularly in word-final position, e.g. albino, amino, audio, calico, casino, 
fiasco, fro, gecko, giro, go, incognito, indigo, impetigo, kilo, libido, lido, 
lino, kimono, manifesto, maraschino, merino, no, patio, piano, piccolo, 
polio, portico, potato, proviso, radio, ratio, rhino, scherzo, silo, studio, 
trio, tobacco, tomato, tremolo, video (for exceptions with /u:/ see 
above) 

often before a consonant cluster, e.g. behold, bold, cold, cuckold, (blind/ 
mani-) fold, gold, hold, marigold, old, scaffold, scold, sold, threshold, 
told, wold; bolt, colt, dolt, jolt, revolt, volt; don’t, wont, won’t; almost, 
ghost, host, most, post; solder, soldier, bolster, holster; molten. Word- 
final <-old> pronounced /auld/ group is one of only five cases where 
the pronunciation of a phonogram/rime is more predictable as a unit 
than from the correspondences of the separate graphemes, and there 
are enough instances to make the rule worth teaching; see section 
A.7 in Appendix A. Exceptions: belong, font, cost, frost, lost and most 
words where <o> is not the last vowel letter, e.g. costume, foster, 
hostage, hostile, all with /o/, scaffolding with /a/, front and others 
listed above with /a/, catacomb, tomb, womb with /u:/ 

in eight words before final <-Il>: boll (also pronounced with /v/), 
droll, plimsoll, poll (head, vote’), roll, scroll, stroll, toll (contrast atoll, 
doll, knoll, loll, poll (‘parrot’), troll, all with /o/), and in four words 
before final <-l>: control, enrol, extol, patrol (in these four words the 
syllable spelt with the relevant <o> is stressed) 
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in a few other words with no other vowel letter: both, comb, gross, loth, 
quoth, sloth, troth 
in all the words in <-osis>, e.g. diagnosis, neurosis 
before a consonant letter other than <r> and word-final <a, o>, e.g. 
aroma, diploma, iota, kimono, sofa (in all these words the syllable 
spelt with the relevant <o> is stressed) 
(with intervening /w/-glide) in a few words before <e>: coeducational, 
coerce, coexist, hydroelectric, phloem, poem, poetic - but most 
examples of <oe> constitute a single grapheme; see the Oddities 
before endings <-ia(ge/I/n), -ion, -ious, -ium>: ammonia, apologia, 
begonia, magnolia, foliage, ceremonial, colonial, social, custodian, 
corrosion, erosion, ex/im-plosion, devotion, lotion, (com/e/loco/ 
pro-)motion, notion, potion; acrimonious, atrocious, ceremonious, 
copious, euphonious, felonious, ferocious, harmonious, parsimonious, 
precocious, sanctimonious, chromium, opium, pandemonium, sodium, 
symposium (in all these words the syllable spelt with the relevant <o> 
is stressed) 
in a ragbag of other words, e.g. bogus, bohemian, bonus, bosun, 
brochure, bromide, cobra, cocoa, codeine, cogent, cohort, colon, 
crocus, focal, focus, grotesque, local, locus, lotus, molar, moment, (e) 
motive, nomad, notary, oval, potent, proton, robust, rodeo, rodent, 
romance, rosary, rotary, rotund, slogan, solar, sonar, total, betroth, 
vocal, votary, votive, yodel, yokel. 
/a/ is the regular pronunciation of unstressed <o> in initial and medial 
positions. Word-initially, however, the pronunciation of <o> as /a/ occurs 
only in the Latin prefix <ob-> and its derivatives, e.g. in oblige, obscene, 
obscure, observe, obsess, obtain, occasion, occur, offend, official. Medially, 
<o> is pronounced /9a/ in: 
the prefixes <con- (and related forms), pro-, to-> pronounced 
/kan (etc.), pra, ta/, e.g. collect, collide, command, commit(tee), 
confess, connect, connive, connubial, consent, continue, contingency, 
contrast (verb, pronounced /kan'trarst/), corrode, corrupt, procure, 
produce, profane, profess(or), prolong; today, together, tomorrow 
the end of the word-elements <bio-, chloro-, micro-, mono-, phono-, 
photo-, saxo-> when unstressed 
the very large set of words with word-final <-ion>, e.g. coercion, 
vision, mission, nation, accordion, aphelion, bastion, battalion, billion, 
bullion, carrion, centurion, champion, clarion, collodion, companion, 
criterion, dominion, ganglion, ion, lion, medallion, million, mullion, 
minion, oblivion, onion, opinion, pavilion, perihelion, pinion, rebellion, 
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scorpion, scullion, stallion, union and even anion, ion, cation (no 
exceptions) 

the (much smaller) set of words with word-final <-eon>, namely 
bludgeon, chameleon, curmudgeon, dudgeon, dungeon, galleon, 
gudgeon, melodeon, Odeon, smidgeon, sturgeon, surgeon, widgeon. 
Only exception: pigeon, with /1/ 

another small set before word-final <m, n>: axiom, bosom, bottom, 
custom; Briton, button, carton, cotton, iron, matron, pardon, siphon/ 
syphon, summon, wanton. Exception: icon, with /p/ 

a further small set where it occurs between a vowel letter and a single 
word-final consonant letter, e.g. chariot, halcyon, idiot, idol, patriot, 
period, vitriol 

the noun-forming ending <-dom> pronounced /dam/, e.g. kingdom, 
wisdom 

the adjectival ending <-some> pronounced /sam/, e.g. handsome, 
and a few other words with the same-sounding ending; besom, 
blossom, buxom, hansom, lissom, ransom, transom 

the noun endings <-ock, -od, -op> pronounced /ak, ad, ap/, e.g. 
bollock, bullock, buttock, hassock, hillock, mattock, pillock, rowlock; 
method, synod; bishop, gallop, wallop 

the second <o> in the suffix <-ology> pronounced /'pladgi:/, e.g. 
biology, chronology, etc. 

the first <o> in the suffix <-ological> pronounced /a'lodgikal/, e.g. 
biological, sociological, etc. 

a ragbag of words including abdomen, acrobat, agony, almoner, 
amphora, anemone, aphrodisiac, automobile (twice), carol, cellophane, 
cenotaph, cupola, custody, daffodil, ebony, espionage, exodus, 
geographic, iodine, irony, isobar, isogloss, isolate, ivory, kaolin, lobelia, 
mandolin, mimeograph, mutton, parabola, parody, pergola, petrol, 
piston, plethora, police, purpose, ricochet, second, sobriety, society, 
theocratic, violate, violin, first <o> in bolero (/ba‘learau/, ‘dance’), 
creosote, piccolo, proprietor, stereophonic, tobacco, tremolo; second 
<o> in broccoli, choreographic, colloquy, gondola, obloquy, rollocking. 


10.28 <o.e> 


Occurs only where the <e> is word-final. 
See Notes for all categories and for how this split digraph is defined, and 
see section 11.4 for a teaching rule relevant to all split digraphs except <y.e>. 
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THE MAIN SYSTEM 


Basic phoneme /av/ 100% e.g. bone, chromosome, remote, cologne 


Other phoneme /u:/ <1% _— only in combe, lose, move, prove, whose 
/kurm, lurz, murv, prurv, hurz/ and 
gamboge pronounced /gzem'bu:3/, plus 
the derived forms ap/dis/im/re-prove, 


remove 
THE REST 

Exceptions to main system strictly speaking, none, but see Notes 
Oddities (none) 

2-phoneme graphemes (none) 

NOTES 


The split digraph <o.e> is defined as covering words where the <e> is 
separated from the <o> by one consonant letter other than <r, w> and 
the <o> is not preceded by a vowel letter and the digraph is pronounced 
/av, ur/. The definition covers both words where the intervening consonant 
letter is an independent grapheme and words where the <e> is also part of 
a digraph <ce, ge (but see below), ve> - see sections 3.7.4, 3.7.6 and 3.8.4, 
and section 7.1 for dual-functioning. 

The only extension needed is to cover combe, with two intervening letters 
forming a consonant digraph. 

However, there are several words with <o, e> separated by a consonant 
letter(s) where the <o> is a separate grapheme and the <e> forms a di/ 
trigraph with the consonant letter(s): barcarole, compote, cote, (be)gone, 
scone, shone with <o> pronounced /bp/, above, become, come, done, dove, 
glove, love, none, shove, some, tonne with /A/, purpose, welcome and all the 
adjectives ending <-some> with /a/. See also section A.6 in Appendix A. 

There are very few English words ending <-oge>: Doge (‘former chief 
magistrate of Venice’), which seems to be the only one in which the regular 
pronunciation of <o.e> as /au/ always applies; gamboge pronounced 
/gem'bau3, gem'bu:3/; and a few even more obscure words derived 
from Greek or French. In abalone, adobe, cicerone, coyote, expose (‘report 
of scandal’), guacamole, sylloge /'stlad3iz/ <o, e> and the intervening 
consonant letter are all separate graphemes. 
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How should opening be analysed if it is pronounced not /‘aupanin/ 
(where the <e> is pronounced /a/) but /‘aupnin/, with no medial schwa? 
Presumably not as the only instance of a non-word-final split digraph (/au/ 
spelt <o.e>), but as another instance of an elided vowel - see section 6.10. 


10.29 <oi> 


THE MAIN SYSTEM 


Basic phoneme />1/ 100% e.g. boil 


THE REST 
pronounced 
Exceptions to <1% in total 
main system 
<oi> /a/ only in connoisseur, porpoise, tortoise 
<oi> as 2-phoneme only in a few words more recently 


sequence /wa:/ borrowed from French, e.g. bourgeoisie, 
coiffeur/se, coiffure, croissant, pointe, 
Soiree, toilette 


Oddity <ois> /ix/ only in chamois (the leather, pronounced 
/‘Seemi:/ (also spelt shammy), as opposed 
to the animal from whose skin it is made, 
pronounced /'fzmwa:/) 


(Other) 2- and 
3-phoneme 
graphemes 


<oir> as 2-phoneme only in coir 
sequence /)1ja/ 


<oir> as 2-phoneme mainly word-final and only in a very few 
sequence /wa:/ words more recently borrowed from 
French, namely abattoir, boudoir, memoir, 
reservoir, voussoir, non-finally, only 
in avoirdupois. /r/-linking occurs in 
memoirist, noirish - see section 3.6 
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<oir> as 3-phoneme only in choir 
sequence /watia/ 


<oire> as 2-phoneme only word-finally and only in a very 
sequence /wa:/ few words more recently borrowed 
from French, namely aide-memoire, 
conservatoire, escritoire, grimoire, 
repertoire 


<ois> as 2-phoneme only word-finally and only in a very few 
sequence /wa:/ words more recently borrowed from 

French, namely avoirdupois, bourgeois 
(/z/ surfaces in bourgeoisie - see section 
7.2), chamois (the animal, pronounced 
/'‘Semwa:/, as opposed to the leather 
made from its skin, pronounced /'fzemi:/, 
the latter also being spelt shammy), patois 
(contrast fatwa). Except in these words, 
<oi, s> are/belong to separate graphemes, 
e.g. in noise, noisy 


NOTE 


If we follow Crystal (2012: 131-2), ‘more recent’ in terms of loanwords from 
French means after the Great Vowel Shift, which was complete by about AD 
1600. 

<o, i> (with automatic intervening /w/-glide) are separate graphemes in 
coincide, coition, coitus, doing, echoic, echoing, egoism, Eloise, going, heroic, 
heroir(e), jingoism, Lois, oboist, soloist, stoic(al, toing and froing. 


10.30 <oo> 


THE MAIN SYSTEM 


For both categories see Notes. 


Basic /v/ 51% e.g. book, good 
phoneme 
Other /ux/ 46% e.g. ooze, afternoon, baboon, booze, 


phoneme mood, snooker, bamboo, zoo, vamoose 
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THE REST 
pronounced 
Exceptions to <oo> /A/ 3% only in blood, flood 
main system 
<oo> /au/ <1% only in brooch 
Oddities <ooh> /ux/ only in pooh 
<oor> /ue/ only in boor, spoor, and sometimes moor, 
Moor, poor. There is /r/-linking in, e.g., 
boorish - see section 3.6. See section 5.6.5 
for the increasing replacement of /ua/ by 
/d:/ 
<oor> /ox/ only in door, floor, also moor, Moor, poor 


if pronounced to rhyme with door, floor. 
There is /r/-linking in Moorish - see section 
3.6 


2-phoneme (none) 
graphemes 


NOTES 


As the television series for teaching children to read used to say, ‘Look out! 
OO is a double agent!’ (sorry, James). That is, in RP <oo> is pronounced 
both /u/ and /ur/ (never /jux/, however), the two pronunciations are fairly 
evenly balanced in frequency, and a few words can be pronounced with 
either phoneme, e.g. food /fud, furd/, hoodlum /‘hudlam, 'hu:dlam/, room 
/rum, ru:m/, woofer /'wufa, 'wu:fa/ (and in some Scots accents there is no 
such distinction anyway). 

<oo> pronounced /u/ occurs in only about 28 stem words, namely the 
four words just listed plus Chinook, forsook, foot, gooseberry /'guzbri:/, 
hoof (and its plural hooves), poof(ter), soot, woof (/wuf/ ‘barking’; contrast 
woof /wu:f/ ‘weft’), wool, and most words ending in <d, k> with no earlier 
vowel letter: good, hood (plus its use as a suffix, e.g. childhood), stood, wood 
(and its derivative woodbine); book, brook, cook, crook, hook, look, nook, 
rook, shook, took (exceptions: brood, mood, rood, snood, gook, snook, spook, 
stook and the longer words bazooka, gobbledegook, snooker, all with /u:/). 

The set of 12 words just listed with <-ook> pronounced /uk/ (against six 
with /urk/) is one of only five cases where the pronunciation of a phonogram/ 
rime is more predictable as a unit than from the correspondences of the 
separate graphemes, and there are enough instances to make the rule worth 
teaching; see section A.7 in Appendix A. 
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In all words other than those pronounced with /u/ and the three Oddities, 
<oo> is pronounced /u:/. 

<0, o> (always with intervening /w/-glide, but not always with helpful 
hyphen) are separate graphemes in co-op, cooperate, co-opt, coordinate, 
co-own, no-one, spermatozoon and other words ending in <-zoon> (‘living 
thing’), zoology. 


10.31 <or> 


N.B. <ore> has a separate entry. 


THE MAIN SYSTEM 

Basic /d:/ 72% regular before a consonant letter 

phoneme (except another <r>), except in the 
following group and as noted under 
Oddities; for word-final position see the 
Exceptions, and for occurrences before 
a vowel letter see Notes 

Other /3x/ 11% regular after initial <w, wh> and before 

phoneme a consonant letter: whortle(berry), word, 
work, world, worm, worse(n), worship, 
worst, wort, worth(y) (exceptions: 
worn with />:/, worrit, worry with /a/, 
worsted ‘cloth’ with /u/); otherwise only 
in attorney 

THE REST 

pronounced 
Exceptions to <or> /a/ 17% never initial; medially, regular in prefix 
main system <for-> pronounced /fa/, e.g. forbid, forget, 


forgive, forsake (but this is a very small set); 
otherwise rare medially, but cf. Deptford (and 
many other placenames with this element), 
Holborn, scissors, stubborn, regular word- 
finally, e.g. error, horror, orator, sponsor, 
exceptions (all with /3:/): abhor, cantor, condor, 
corridor, cuspidor, décor, for (when stressed), 
grantor, humidor, ichor, lessor, matador, 
mentor, mortgagor, nor, or, praetor, quaestor, 
realtor, tor, toreador, vendor 
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<or> /v/ only in worsted (‘cloth’) pronounced /'wustid/ 
(when pronounced /'w3istid/ it means 
‘defeated’) 
Oddities <orp> /92:/ only in corps (plural), pronounced /k3:z/ 
<orps> /2:/ only in corps (singular), pronounced /k3:/ 
<orr> = /d:/ only in abhorred (in abhorrent, borrow, horrible, 


horrid, torrid <o, rr> are separate graphemes 
pronounced /p, r/; and in worrit, worry <o, rr> 
are pronounced /a, r) 

<ort> /3:/ only in mortgage, rapport. /t/ surfaces in 
rapporteur - see section 7.2 


2-phoneme (none) 
graphemes 


NOTE 


Before a vowel letter, <or> is pronounced /3:/ only in aurora, authorial, borax, 
chlorine, choral, chorus, corporeal, decorum, dictatorial, editorial, euphoria, 
flora), forum, glory, memorial, oracy, oral, oration, oratorio (second <or>), 
orient (noun, ‘The East’, pronounced /'d:ritjant/), quorum, variorum. \n all 
these words, the <r> is both part of the digraph <or> pronounced /):/ and 
a grapheme in its own right pronounced /r/ (for dual-functioning see section 
7.1), and the <or> is stressed (except in oration />:'retfan/). Where the <or> is 
stem-final and the ending is a suffix, /r/-linking also occurs (see section 3.6), 
namely in authorial, dictatorial, editorial, memorial. \n all other cases beforea 
vowel letter, <o, r> are separate graphemes, e.g. in corporation (second <or>), 
decorate, euphoric, florist, memory, orient (verb, ‘align correctly’, pronounced 
/orix'jent/), first <or> in orator, oratorio. For <or> as an elided vowel spelling 
in comfortable see section 6.10. 


10.32 <ore> 


THE MAIN SYSTEM 


Only />:1/ 100% never initial; medially, only in compounds 
phoneme of fore-, of which there are 60+ (only 
(almost) exception: forecastle pronounced 


/‘fauksal/; also pronounced /'fo:karsal/); 
regular word-finally, e.g. carnivore, wore 
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NOTE 


In all other cases, <o, r, e> are separate graphemes, e.g. in anorexia, forest. 


10.33 <ou> 


THE MAIN SYSTEM 


Basic /au/ 48% e.g. about, out, pout, rout 
phoneme 


THE REST 


pronounced 


Exceptions to <ou> /ux/ 29% only in accoutrement, acoustic, ampoule, 

main system barouche, bayou, bijou, bivouac, boudoir, 
boulevard, bouquet, boutique, canteloupe, 
caribou, carousel, cartouche, cougar, coulomb, 
coulter, coupe, coupon, (un)couth, croup, 
croupier, crouton, douche, embouchure, 
frou-frou, ghoul, goujon, goulash, group, 
insouciance, joule, louvre, marabou, moussaka, 
mousse, oubliette, outré, ouzo, pirouette, 
recoup, rouble, rouge, roulette, route, routine, 
silhouette, sou, soubrette, soufflé, soup, 
souvenir, toucan, toupee, troubadour, troupe, 
trousseau, vermouth pronounced /va'mu:8/ 
(also pronounced /'v3:ma@/), voussoir, you 


<ou> /a/ 15% regular in the adjectival ending <-ous> 
pronounced /as/, e.g. anxious, famous. 
Otherwise only in camouflage, limousine, 
moustache, tambourine, vermouth pronounced 
/'v31me@/ (also pronounced /va'mu:8/) 


<ou> /A/ 6% only in chough, Colclough pronounced 
/‘'kaulklaf/ (also pronounced /'kaukli:/), 
country, couple, couplet, courage, cousin, 
double, doublet, doubloon, enough, flourish, 
“hiccough (properly spelt hiccup), housewife 
(‘sewing kit’, pronounced /'hazif/), nourish, 
rough, slough (‘shed skin’), sough, souther-n/ly, 
touch, tough, trouble, young 
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<ou> /3u/ 1% only in boulder, bouquet pronounced 
/bau'ker/ (also pronounced /bu:'ker/), 
moulder/y), moult(ed/ing), poultice, poultry, 
shoulder, smoulder, soul 


<ou> /o/ only in cough, hough, trough 


<ou> /u/ only in courier, pouffe pronounced /puf/ (also 
pronounced /pu:f/) 


<ou> /w/ only in ouija 
Oddities <oue>  /u:/ only in denouement, moue 
On all the <ough> categories see Notes 


<ough> /3:/ 42% of pronunciations of <ough> 
only in bought, brought, fought, nought, ought, 
(be-)sought, thought, wrought 


<ough> /ur/ 27% of pronunciations of <ough> 
only in brougham, through 


<ough> /avu/ 24% of pronunciations of <ough> 
only in dough, furlough, (although 


<ough> /au/ 3% of pronunciations of <ough> 
only in bough, doughty, drought, plough, 
slough (‘muddy place’) 


<ough> /a/ 2% of pronunciations of <ough> 
only in borough, thorough 


<ough> = /i:/ only in Colclough pronounced /'kaukli:/ (also 
pronounced /'kaulklaf/) 


<oul> /v/ only in could, should, would (contrast mould 
/meauld/ - another point in favour of the US 
spelling mold) 

<oup> = /ur/ only in coup 

<our> /o1/ 67% of pronunciations of <our> 


only in court(esan), course, four, mourn, pour, 
source, yours) 


<our> /2/ 25% of pronunciations of <our> 
regular word-finally, e.g. arbour, ardour, 
armour, behaviour, candour, clamour, clangour, 
colour, endeavour, favour, fervour, flavour, 
glamour, harbour, honour, humour, labour, 
neighbour, odour, parlour, rancour, rigour, 
rumour, saviour, splendour, succour, tumour, 
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valour, vapour, vigour. In many of these words 
US spelling has <or>. For exceptions see next 
three paragraphs and the 2-phoneme sequence 


<our> /3:/ 7% of pronunciations of <our> 
only medial, and only in adjourn, bourbon 
(/'b3:ban/ ‘whiskey’), courteous, courtesy, 
journal, journey, scourge, sojourn and tourney 
pronounced /'t3:ni:/ (also pronounced 
/‘tuaniz/) 


<our> /ve/ 1% of pronunciations of <our> 
only in amour, bourbon (/'buabon/ ‘biscuit’), 
bourgeois(ie), bourse, contour, detour, dour 
pronounced /dus/ (also pronounced /'dauwa/), 
entourage, gourd, gourmand, gourmet, 
houri, mourn (e.g. in mourning pronounced 
/‘muantn/ to distinguish it carefully from 
morning pronounced /'mo:nt1n/), potpourri (if 
we take the second <r> as spelling /r/), tour, 
tournament, tourney pronounced /'tuani:/ 
(also pronounced /'t3:ni:/), tourniquet, 
troubadour, velour. There is /r/-linking in, 
e.g., touring - see section 3.6, and in entourage, 
houri the <r> is both part of grapheme <our> 
and a grapheme in its own right spelling /r/. 
For dual-functioning see section 7.1. See 
section 5.6.5 for the increasing replacement of 
/v9/ by /3:/ 


<ou’re> /31/ only in you’re. See section A.9 in Appendix A 
<ous> /ux/ only in rendezvous 
<out> /ux/ only in mange-tout, ragout, surtout 
<oux>  =/ur/ only in billet-doux, roux 
2-phoneme <our> as in devour, flour, lour, our, ours, scour, Sour and 
grapheme 2-phoneme dour pronounced /'dauwa/ (also pronounced 
sequence /dua/) 
/auwa/ 


NOTES 


<ou, r> are separate graphemes in courage, flourish, nourish. 
For <ou> as an elided vowel spelling in favourable, honourable see 
section 6.10. 
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The six categories of <ough> listed above are those where it is a four- 
letter grapheme pronounced as a single phoneme, and the percentages 
given are for those circumstances. In other cases <ou, gh> are separate 
graphemes with separate pronunciations. For completeness the six 
2-phoneme pronunciations of <ough> are listed here in the same manner 
as single-phoneme pronunciations: 

<ough> pronounced /vf/ only in cough, trough 

<ough> pronounced /ok/ only in hough 

<ough> pronounced /px/ only in (Irish) lough /lox/ 

<ough> pronounced /af/ only in chough, Colclough pronounced 

/‘kaulklaf/, enough, slough (‘shed skin’), sough, tough 

<ough> pronounced /ap/ only in the (mis)spelling of hiccup as “hiccough 

<ough> pronounced /ax/ only in McCullough pronounced /ma'kalax/ 
Thus the 33 words containing <ough> have 12 pronunciations between 
them. The only semblance of a rule is that most of the words containing 
<-ought> (bought, brought, fought, nought, ought, sought, thought, 
wrought) are pronounced /d:t/, the only two exceptions being doughty, 
drought with /aut/. Note that two of the 2-phoneme pronuncations (/ox/ 
in lough, /ax/ in McCullough) do not occur in English stem words, and are 
therefore included here only for interest - they do not appear in my main 
lists of correspondences. See also Notes to section 9.15. 


10.34 <ow> 


THE MAIN SYSTEM 


Basic phoneme /au/ 45% e.g. allow, brown, cow, coward, how, 
owl 

Other phoneme /au/ 44% regular word-finally after <I, r>. See 
Note 

THE REST 

pronounced 
Exceptions to main <ow> /b/ 10% only in (ac)knowledge, rowlock 
system 
<ow> /a/ <1% only in Meadowhall (locally, in Sheffield), 


sorrowful 


The grapheme-phoneme correspondences, 2 419 


Oddity <owe> /au/ only in owe 
2-phoneme (none) 

graphemes 

NOTES 


/au/ is the regular pronunciation word-finally after <I, r>: bellow, below, 
billow, blow, bungalow, callow, fallow, fellow, flow, follow, furbelow, glow, 
hallow, hollow, low, mallow, mellow, pillow, sallow, shallow, slow, swallow, 
tallow, wallow, whitlow, willow, yellow; arrow, barrow, borrow, burrow, 
crow, escrow, farrow, furrow, grow, harrow, marrow, morrow, narrow, row 
/rau/ (line, use oars’), sorrow, sparrow, throw, yarrow (only exceptions: 
allow /a'lau/, plow; brow, prow, row /rau/ ‘squabble’), trow). 

Otherwise /au/ occurs only in: (word-finally) bestow, bow (goes with 
arrow; contrast bow /bau/ ‘incline deferentially’), elbow, know, meadow, 
minnow, mow, shadow, show, snow, sow (‘plant seed’; contrast sow /sau/ 
‘female pig’), stow, tow, widow, window, winnow; (medially) bowl, own and 
the irregular past participles blown, grown, thrown, which derive from verbs 
listed above, plus flown, known, mown, shown. 

All other occurrences of <ow> (bar the exceptions) are pronounced /au/. 


10.35 <oy> 


THE MAIN SYSTEM 

Basic />1/ 100% e.g. boy 

phoneme 

THE REST 

pronounced 

Exception to <oy> /at/ only in coyote. The <y> is both part of 

main system <oy> and a grapheme in its own right 
pronounced /j/. For dual-functioning see 
section 7.1 

Oddities (none) 

2-phoneme <oy> as 2-phoneme only in foyer pronounced /'fwarjer/ (also 


grapheme sequence /waiI/ pronounced /'foijet, 'fotja/), voyeur 
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NOTE 


In medial examples of <oy> pronounced /31/ before a vowel letter, namely 
in arroyo, employee, foyer pronounced /'foijer, 'faija/, loyal, royal, soya, 
voyage and, | suppose, coy-er/est, comparative and superlative of coy, the 
<y> is both part of <oy> spelling /31/ and a grapheme in its own right 
pronounced /j/. For dual-functioning see section 7.1. 


10.36 <u> 


N.B. <ue, u.e, ur> have separate entries. 


THE MAIN SYSTEM 


On all these categories except /w/ see Notes. 


Basic phoneme /A/ 44% — e.g. but, up; regular in prefix un- 


Other phonemes /vu/ 6% in RP, only in 50+ stem words, but 
many are very frequent; regular in suffix 
<-ful> 


/ux/ 3% e.g. ruby 


/w/ <1%_ regular after <q> pronounced /k/ 

(for exceptions, see under <cqu, qu, 
que> in sections 9.7, 9.27); also found 
in a few words after <c, g, S, ss, Z>, 
namely cuirass, cuisine, cuisse; anguish, 
distinguish, extinguish, guacamole, 
guano, guava, iguana pronounced 
/tgwatna/, language, languish, linguist, 
penguin, sanguine, segue, unguent; 
persuade, pueblo, puissan-ce/t, 
pursuivant, suave, suede, suite; assuage, 
dissuade; Venezuela and some very rare 
words; otherwise perhaps only in ennui, 
etui /on'wir, e'twi:/ 

Frequent /jux/ 22% e.g. pupil, union; word-final only in 

2-phoneme coypu, menu, ormolu, parvenu 

sequence 


THE REST 

Exceptions to main <u> 

system 
<u> 
<u> 
<u> 

Oddities <ua> 
<ui> 
<ui> 
<uu> 


Other 2-phoneme <ua> 


graphemes 
<ui> 
<ut> 
<uu> 
NOTES 
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pronounced 


/2/ 


/1/ 


/e/ 


as 2-phoneme 
sequence /ja/ 


/2/ 


as 2-phoneme 
sequence /ja/ 


as 2-phoneme 
sequence /ju:/ 


as 2-phoneme 
sequence /ju:/ 


as 2-phoneme 
sequence /ju:/ 


10% regular when unstressed. See Notes 


2% only in busy, business, lettuce, minute 
(noun /'mintt/, ‘60 seconds’), missus 


<1% only in burial, bury 


14% in some words when unstressed See 
Notes 


in nouns, only in actuary, estuary, 
mortuary, obituary, sanctuary, statuary, 
voluptuary, when pronounced with /tfari:/ 
rather than /tfuari:/ (see also under /tf/, 
section 3.7.2), plus casualty /‘kezalti:/, 
February /'febrari:/, victuals /'v1talz/; 
also often in rapid pronunciation of 
adjectives like actual (see /t{/, section 
3.7.2), sexual and especially adverbs 
derived from them. See Notes 


only in bruise, bruit, cruise, fruit, juice, 
recruit, sluice, suit. See Notes 


only in duiker, Ruislip 
only in muumuu (twice) 


only in January, valuable 


only in nuisance, pursuit 


only in debut. /t/ surfaces in debutante - 


see section 7.2 


only in vacuum pronounced /'vekju:m/ 


The consonantal pronunciation of <u> as /w/ is dealt with above. It 


is curious that the consonantal 


and vocalic pronunciations of <u> 


never occur adjacently, i.e. there are no instances of <uu> pronounced 


/WA WU WU: Wea WI we/ or any of those with a /j/ glide between the two 
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phonemes. This is despite the fact that at least one Latin word with such 
a sequence (equus /'ekwus/, ‘horse’) has various English derivatives - but 
they all have /e/ spelt <e> after /w/ spelt <u>. Where sequences such as 
/wa/ occur in English the /w/ is always spelt <w> and the vowel is rarely 
spelt <u> - the only words beginning <wu> appear to be wunderkind, 
wuss with <u> pronounced /u/, and Wurlitzer with <ur> pronounced /3:/. 

For instances of <u> as an elided vowel see section 6.10. 

Except in the 10 words listed under Oddities, <u, i> always are/belong 
to separate graphemes, e.g. in several words listed under <u> pronounced 
/ux, ju:/ below, including in particular circuitous, fruition (with intervening 
/w/-glide), plus words where <u> is part of a digraph with the preceding 
consonant letter: biscuit, build, cataloguing and a few more words with 
potential <e>-deletion from <-gue> before <-ing>, circuit, guide, guild, 
guilder, guile, guillemot, guillotine, guilt, guinea, (dis)guise, guitar, suite. 

In RP (as distinct from local accents of the north of England, in which /u/ 
is much more frequent), <u> is pronounced /u/ in only about 57 stem words: 
ambush, Buddha, buffet /‘bufe1/ (‘food’), bulbul (twice), bull, bullace, bullet, 
bulletin, Bullingdon, bullion, bullock, bully, bulrush (first <u>), bulwark (also 
pronounced with /A/), bush, bushel, butch, butcher, cuckoo, (mea) culpa, 
cushion, cushty, cushy, ebullient (also pronounced with /a/), fulcrum (both 
<u>’s), full, fulmar, fundi(/'fundi:/ South and East African English for ‘expert / 
skilled person’/in Britain, a member of the fundamentalist, uncompromising 
wing of the German Green Party), gerenuk, kaput, kibbutz, kukri, lungi, 
lutz, mullah, mush (/mvf§/, slang for ‘friend’), muslim, Musulman (twice), 
umlaut (first <u>), Zumba, pud, pudding, pull, pullet, pulpit, push, puss, put, 
putsch, schuss, s(dhtum, shufti, sputnik, sugar, suk, Sunni, thurible, thurifer, 
thruppence, tuk-tuk (twice), plus derivatives including Buddhism, bullock, 
fulfil, fully, ful\)ness, fulsome, and in the adjective/noun suffix <-ful> - 
there are at least 150 words so formed, e.g. beautiful, handful. Unstressed 
in that suffix but stressed in all other cases except ambush, fulcrum (second 
<u>), fulfil, gerenuk, tuk-tuk (second <u>). 

In RP (as distinct from local accents of the north of England, in which /a/ 
does not occur) <u> is pronounced /a/: 

regularly before geminate and doubled consonant spellings, e.g. bubble, 
bucket, duck, muddle, rudd, cudgel, judge, bluff, buffalo, muggle, gull, 
ullage, unnecessary, supper, curry, cussed (‘stubborn’), fuss, hutch, 
(e)scutcheon, butter, putt, puzzle. Exceptions: bull, bullet, Bullingdon, 
bullion, bullock, butch, butcher, cuckoo, ebullient (also pronounced 
with /A/), full, fully, mullah, pudding, pull, both pronunciations of 
stumm, puss, putsch, thruppence, with /u/ 
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regularly in other words where it is the only vowel letter and non- 
final, e.g. bulk, brush, crux, dumb, dung, flux, hulk, just, mud, 
mush (‘squashy mess/command to husky’), plump, sculpt, sulk, up. 
See section 11.3 for a teaching rule relevant to ..VC monosyllables. 
Exceptions: bush, mush (‘friend’), pud, pushwith /u/, ruth, truth with 
/ux/ (for brusque see under <u.e>, section 10.38) 
in the prefix <sub-> when stressed, e.g. in subject (noun, pronounced 
/'sAbdgrkt/), sublimate, subterfuge, etc. 
regularly where it is the last vowel letter in the word and non-final 
and stressed, e.g. abrupt, adjust, annul, begun, robust, rotund. Only 
exception: impugn, with /jur/ 
mostly otherwise before two or more consonant letters or <x> where 
there is at least one later vowel letter, e.g. blunder, butler, divulge, 
dungeon, fundi (/'fandat/, plural of fundus ‘inner corner of organ’), 
hundred, husband, inculcate, indulge, indulgence, presumption, 
promulgate, sunder, truncate, truncheon, tuxedo, ulterior. Exceptions: 
duplicate, duplicity, fuchsia, hubris, lubricate, lucrative, lucre, 
nutritious, putrid, rubric and the prefix <supra->, with /(j)ur/ 
in a ragbag of other stem words, e.g. bunion, ketchup, punish, study, 
triumph, viaduct 
in the native English prefix <un-> meaning ‘not’. 
Unlike the other vowel letters as single-letter graphemes, <u> is not 
pronounced short, i.e. /A/, before a consonant and word-final <-ic(al)>. 
Instead it is pronounced /(j)ur/, e.g. cubic, music, punic, runic, tunic - see 
below. 

A test for distinguishing the (Germanic) prefix <un-> ‘not’ pronounced 
/an/ from the (Latin) initial element <un(i)-> ‘one’ pronounced /jurn(iz)/ 
which seems mainly reliable is this: Remove <un>. If what remains is a word, 
it is <un-> pronounced /an/; if what remains is not a word, it is <un(i)-> 
pronounced /jurn(iz)/. For example, uninformed has /an/; uniformed has 
/jutnix/. There appear to be only two words for which this does not work: 
union, unit, but neither is likely to be misunderstood, there being no words 
*un-ion ‘not an ion’, “un-it ‘not an it’. However, based on un(-)ion is one of 
the longest homographs in English: unionised, which is either union-ised 
‘belonging to a trade union’ or un-ionised ‘not converted into ions’. 

<u> is pronounced /u:/: 

word-finally, only in ecru, flu, guru, impromptu, juju, plus gnu if <gn> 
is analysed as pronounced /nj/ 

in words where it is the only vowel letter and is followed by a consonant 
letter: only in ruth, truth 
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in suffixed forms of stem words in <-u.e> pronounced /u:/ after 

<e>-deletion (sometimes with change of stem-final consonant), e.g. 

brutal, crudity, inclusive, intrusion, reclusive, runic, secluded, trucial, 

plus truly - in all these cases, the preceding letter is <I> or <r> 

in a small set of other words where there is at least one later vowel 

letter, mostly after <I, r>, e.g. (af)fluent, alluvial, bruin, cruel, fluid, 

fluorescent, frugal, fruition, gluten, inscrutable, lucrative, lucre, 

ludicrous, luna-cy/tic, lunar, lupus, prudent, rubric, ruby, ruin, runic, 

scruple, scrutiny, solution, truant, plus judicial, judo, jujitsu, suicide, 

superior, also in casual, sexual, usual, visual pronounced /'kezu:wal, 

‘sekfurwal, ‘jurzurwal, 'vizurweal/. Where the letter following <u> is a 

vowel, the pronunciation has an intervening /w/-glide. 
Wijk (1960: 15) points out that /ur/ is regular after /dg, r, f, j/ (mainly 
spelt <j, r, ch/sh, y> and after /I/ spelt <I> after another consonant, both 
when <u> is a single-letter grapheme and in <u.e>. | would add that in 
current RP /u:/ is also regular after <d, t> pronounced /d3, tf/, e.g. in 
arduous, assiduous, deciduous, dual, ducal, duel, duet, duly, duty, gradual, 
graduate, individual, residual, tuba, tuber, tulip, tumour, tumult(uous), 
tumulus, tuna, tunic, tureen, tutor; attitude, multitude, solitude, costume; 
fortune, importune, opportune; virtuoso; contemptuous,  fatuous, 
impetuous, incestuous, perpetuate, spirituous, sumptuous, tempestuous, 
tortuous, tumultuous, unctuous, virtuous, voluptuous; obtuse, de/in/pro/ 
re/sub-stitution. accentual, actual, conceptual, contractual, effectual, 
eventual, factual, habitual, intellectual, mutual, perpetual, punctual, ritual, 
Spiritual, textual, virtual, actuary, estuary, mortuary, obituary, sanctuary, 
Statuary, voluptuary. Again, where the letter following <u> is a vowel, the 
pronunciation has an intervening /w/-glide. 

<u> is pronounced /jur/: 

word-finally, only in coypu, menu, ormolu, parvenu, plus gnu if <gn> 

is analysed as pronounced /n/ 

in words where it is the only vowel letter and is followed by a consonant 

letter: only in impugn 

mostly before a consonant letter and word-final <-ic(al)>, e.g. cubic, 

music, punic, tunic (exception: runic) 

in suffixed forms of stem words in <-u.e> pronounced /ju:/ after 

<e>-deletion (sometimes with change of stem-final consonant), e.g. 

accusation, allusion, at/con/dis/re-tribution, collusion, communal, 

community, computation, con/in-stitution, consuming, delusion, 
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disputacious, enthusiasm, elocution, evolution, execution, funeral, 
(con/dif/in/pro/trans-)fusion,  (dis)illusion, nudity, persecution, 
pollution, prosecution, reducible, reputation, revolution, usage 
ina large set of other words where there is at least one later vowel letter, 
e.g. annual, computer, continuity, cubicle, cubit, duplicate, duplicity, 
fuchsia, fuel, genuine, hubris, human, humus, impecunious, ingenuity, 
lubricate, (pellucid, mucus, mutate, numerous, nutritious, peculiar, 
puny, putrid, student, stupid, tenuous, the prefix <supra-> and many 
words with the (Latin) initial element <un(i)-> (‘one’), e.g. unanimous, 
unicorn, union, unison, unit, universe. See above on distinguishing 
words with <un(i)-> from those with <un-> pronounced /an/, the 
native English prefix meaning ‘not’. Where the letter following <u> is 
a vowel, the pronunciation has an intervening /w/-glide. 

<u> is pronounced /2/: 
in a set of words containing <du, tu> pronounced /dga, tfa/ when the 
<u> is the penultimate vowel grapheme in the word and unstressed, 
and separated from the next vowel letter by a single consonant 
letter, and the main stress is on the preceding syllable: (in)credulous, 
educate, glandular, modular, nodular, pendulum, sedulous; century, 
congratulate, fistula, fortunate, naturist, petulant/ce, postulant, 
postulate, saturate, spatula, titular and derivatives, e.g. education, 
saturation (cf. words with /ja/, below) 
in all occurrences of the endings <-ium, -ius> (with intervening 
/j/-glide), e.g. atrium, bacterium, compendium, delirium, geranium, 
gymnasium, medium, opium, potassium, radium, stadium, tedium and 
about 200 others ending in <-ium>; genius, radius 
in all occurrences of the endings <-um, -us> without a preceding <i>, 
e.g. album, agendum, carborundum, colosseum, linoleum, lyceum, 
mausoleum, maximum, museum, petroleum, rectum, referendum, 
abacus, anus, bogus, bonus, cactus, campus, Caucus, census, chorus, 
circus, citrus, corpus, crocus, discus, emeritus, exodus, focus, fungus, 
genus, hiatus, hippopotamus, isthmus, litmus, lotus, octopus, onus, 
nucleus, rhombus, stimulus, surplus, syllabus, Taurus, terminus, 
tinnitus, virus and hundreds more 
in prefix <sub-> when unstressed, e.g. subdue, subject (verb, 
pronounced /sab'dgekt/), sublime, submerge, submit, subside, subsist, 
substantial 
otherwise in, e.g., cherub, catsup, chirrup, stirrup, syrup. 
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Also, in the entry for <ur>, section 10.39, reference is made to the long 
list in section 5.4.7 of nouns ending in <-ture> pronounced /tfa/. In 
adjectives derived from nouns in that list, e.g. adventurous /ad'vent{faras/, 
natural /‘nztfaral/), and especially in adverbs derived from those adjectives, 
e.g. adventurously, naturally, <u> may be pronounced /a/ - or in rapid 
pronunciation the schwa may be absent (/ad'ventfras(li:), ‘naetfral(i:)/), in 
which case the <u> is elided - see section 6.10. | think that the tendency for 
the vowel to disappear in rapid speech is stronger in the adverbs alluded to 
in this paragraph and listed in section 5.4.7 than in the adjectives. 

<u> is pronounced /ja/ in several words where it is the penultimate 
vowel grapheme and unstressed, and separated from the next vowel letter 
by a single consonant letter, and main stress is on the preceding syllable, 
e.g. amulet, angular /‘zngjala/, argument, calculate, chasuble, coagulate, 
contributor, corpuscular, distributor, emulate, fabulous, garrulous, 
immunise, inaugural, incubus, insula-r/te, jugular, manipulate, muscular, 
nebulous, particular, penury, popul(o)us, querulous, regula-r/te, scapula(n), 
scroful-a/ous, scrupulous, stimul-ant/ate/us, succubus, tremulous, truculent, 
vernacular, also in, e.g. glandular, spatula, if pronounced with /dja, tja/ 
rather than /dga, t{a/ (see list above); also in the two words copulation, 
population where it is the antepenultimate vowel grapheme (and unstressed) 
and main stress is on the following syllable. 


10.37 <ue> 


N.B. <u.e> has a separate entry. 
Does not occur initially. Except in gruesome, muesli, Tuesday, only 
word-final. 


THE MAIN SYSTEM 


For both categories see Notes. 


Basic phoneme /ur/ 41% e.g. glue 
Frequent 2-phoneme sequence /jur/ 59% e.g. cue 
THE REST 


(None). 
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NOTES 


This grapheme is not to be confused with word-final <-ue> in <gue, que>, 
where it is sometimes part of those graphemes - see sections 9.15, 9.27. 

/u:/ is regular after <I, r>, namely in blue, clue, flue, glue, slue; accrue, 
construe, gruesome, imbrue, rue, sprue, true, and predominates after <d, 
t> (where older pronunciations with /jur/ are still sometimes heard): due, 
residue, subdue; statue, Tuesday pronounced /'tfu:zdi:/, virtue, plus issue, 
sue, tissue. Only definite exception: value, with /jur/. 

/jur/ is regular in almost all other cases, namely ague, argue, avenue, 
barbecue, continue, cue, curlicue, ensue, hue, imbue, pursue, queue, rescue, 
retinue, revenue, revue, value, venue. Exception: muesli. 

Except in gruesome, muesli, Tuesday, <u, e> are always separate 
graphemes in medial position, e.g. cruel /‘kru:wal/, duel /'dgju:wal/ 
(homophonous with jewel, duet /dgu:'wet/ (words like these three have 
an intervening /w/-glide), suede /sweid/ (where <u> spells /w/ anyway). 
There is also one 2-grapheme exception in final position: segue /'segwe1/ 
(where <u> again spells /w/). 


10.38 <u.e> 


Occurs only where the <e> is word-final. 
See Notes for both categories and for how this split digraph is defined, and 
see section 11.4 for a teaching rule relevant to all split digraphs except <y.e>. 


THE MAIN SYSTEM 


For both categories see Notes. 


Basic phoneme /ux/ 11% e.g. rude 
Frequent 2-phoneme pronunciation /jur/ 89% e.g. cute 
THE REST 

(None). 

NOTES 


The split digraph <u.e> is defined as covering words where the <e> is 
separated from the <u> by one consonant letter other than <r, x> and the 
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<u> is not preceded by a vowel letter and the digraph is pronounced /u:/ 
or /jur/. The definition covers both words where the intervening consonant 
letter is an independent grapheme and words where the <e> is also part of 
a split digraph <ce, ge> - see sections 3.7.4, 3.7.6 and 3.8.4, and section 
7.1 for dual-functioning. 

The only extensions needed are to cover five words with two intervening 
letters forming consonant digraphs: butte, fugue, peruque, ruche, tulle, 
plus brusque pronounced /brursk/ (also pronounced /brask/), with three 
intervening letters (including <qu> as a digraph) forming the consonant 
cluster /sk/. The only exceptions appear to be /ettuce, minute (/'mintt/, 
‘60 seconds’), with <u> pronounced /1/ and <ce, te> forming digraphs 
pronounced /s, t/, and deluxe with <u> pronounced /a/ and <xe> forming 
a 2-phoneme digraph pronounced /ks/. See also section A.6 in Appendix A. 

/u:/ is regular after <ch, j, |, r>, namely in (para)chute; June, jupe, jute; 
fluke, flume, flute, include and various other words in <-clude>, luge, 
lute, plume, recluse; abstruse, brume, brute, crude, intrude and various 
other words in <-trude>, peruque, peruse, prude, prune, ruche, rude, 
rule, rune, ruse, spruce, truce, and predominates after <d, t> (where 
older pronunciations with /ju:x/ are still sometimes heard): duke, dune 
(homophonous with June), introduce, reduce, module, nodule; tube, tulle, 
tune. Exceptions: delude, mameluke, pollute, with /jux/. 

/ju:/ is regular in almost all other cases, e.g. abuse, accuse, amuse, (at/ 
con/dis-)tribute, centrifuge, commune (noun and verb), compute, consume, 
delude, deluge, dispute, enthuse, globule, huge, minute /mat'nju:t/ (‘tiny’), 
mule, mute, nude, perfume, pollute, refuge, repute, subterfuge, use (noun 
and verb). 

There are very few English words ending <-uge>: centrifuge, deluge, 
huge, refuge, subterfuge and a few more rarities, all with /ju:d3/, plus /uge 
with /ur3/ (there are none with /urd3, jur3/). 

The only word in which a final <e> after <u>+consonant is ‘pronounced’ 
rather than ‘silent’ appears to be resume (‘c.v.’). 


10.39 <ur> 


THE MAIN SYSTEM 


Basic phoneme /3x/ 70% e.g. fur, occur, turn, urgent. See Notes 
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THE REST 


pronounced 


Exceptions to <ur> /a/ 


main system 


<ur> /vea/ 


<ur> as 2-phoneme 


sequence /jua/ 


30% never initial; word-finally, only in 
augur, femur, langur, lemur, murmur, 
sulphur, medially, regular in prefixes 

pur-, sur- when unstressed, e.g. purgation, 
purloin, purport, pursue, purvey, surmise, 
Surmount, surpass, surprise, survey (verb), 
Survive; otherwise only in a few words, 

e.g. auburn, expurgate, jodhpurs, liturgy, 
metallurgy, Saturday, saturnine. See Notes 


<1% never word-final; initially, only 

in urtext, otherwise only medial and 
only in centurion, durable, (en)during, 
duress, injurious, juror, jury, prurient/ 
ce, rural, usurious, plus luxuriance, 
luxuriant, luxuriate, luxurious (/lag'3uaritj- 
ans/ant/ert/as/), maturity, tureen and 
derived forms of some words ending 

in <-ure> pronounced /ua/ (see below) 
after <e>-deletion, e.g. insurance. In all 
these medial cases the <r> is both part 
of <ur> and a grapheme in its own right 
pronounced r/. For dual-functioning see 
section 7.1. See also Notes 


<1% never word-final; initially, only in 
urea and various words derived from it, 
e.g. urethra, urine, urology, otherwise 
only medial and only in bravura, curate 
(both the noun ‘junior cleric’ pronounced 
/‘kjuerat/ and the verb ‘mount an 
exhibition’ pronounced /kjua'reit/), curious, 
furore whether pronounced /fjua'ro:re1/ 
or /'fjuers:/, furious, fury, lurid, mural, 
purify, purity, security, spurious and 
derived forms of some words ending in 
<-ure> pronounced /jua/ (see below) 
after <e>-deletion, e.g. manuring. |n all 
these medial cases the <r> is both part 
of <ur> and a grapheme in its own right 
pronounced /r/. For dual-functioning see 
section 7.1. See also Notes 
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Oddities 


<ure> 


<ure> 


<urr> 


Other 2-phoneme <ure> 
pronunciations 


/a/ 


/va/ 


/31/ 


as 2-phoneme 
sequence /ja/ 


All word-final only in stem words 


the regular pronunciation of <ure>, e.g. in 
lecture, nature and dozens of other words 
ending in unstressed <-ture> (for a long 
list see section 5.4.7), censure, conjure (‘do 
magic tricks’) pronounced /'kandga/, figure, 
injure, leisure, measure, perjure, pleasure, 
pressure, procedure, tonsure, treasure, 
verdure (cf. verger). For exceptions see 

the next paragraph and the 2-phoneme 
graphemes. Many of these words allow 
/r/-linking (see section 3.6), e.g. natural, 
pleasurable, procedural 


only in abjure, adjure, assure, brochure, 
(also pronounced with final /a/), conjure 
(‘summon with an oath’) pronounced 
/kan'djua/, cynosure, embouchure, ensure, 
insure, sure; also caricature, overture if 
<-ture> is pronounced /tfua/ rather than 
/t{a/, and words like endure, mature if 
<-dure, -ture> are pronounced /dgua, tfuea/ 
(see two paragraphs below and sections 
9.12 and 9.33). Many of these words allow 
/r/-linking (see section 3.6), e.g. assurance, 
maturity. See also Notes 


only in burr, purr and suffixed forms of 
words ending in <-ur>, e.g. blurred, furry, 
demurring, occurred. /r/-linking occurs 

in furry, demurring - see section 3.6. <u, 
rr> are separate graphemes pronounced 

/A, t/ in, e.g., demurral, furrier pronounced 
/’farisja/ ‘dealer in furs’ (contrast the word 
of the same spelling pronounced /’f3:ri:ja/ 
‘more furry’), hurry, scurrilous, scurry, slurry 


only in failure, tenure and azure 
pronounced /'xzja, 'e1zja/ (also 
pronounced /'xzjua, 'e1zjua, '23a, 'e1Za/); 
also possibly in words like endure, mature 
if <-dure, -ture> are pronounced 
conservatively with /dj, tj/ not yet affricated 
to /d3, t{/ (see two paragraphs above and 
sections 9.12 and 9.33). See also Notes 


<ure> 


NOTES 
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as 2-phoneme only in coiffure, cure, demure, immure, 
sequence /jua/ inure, lure, manure, photogravure, pure, 


secure, sinecure. See also Notes 


There are very few words with initial <ur->. Most are derivatives of urea, 
all with a following vowel letter and with <ur> pronounced /jua/ and <r> 
also pronounced /r/, i.e. dual-functioning (see section 7.1). There are only 
six words with a following consonant letter: urbane, urchin, urge, urgent, 
urn with regular /3:/ and urtext with /ua/. Except in urtext, urea and its 
derivatives, and the exceptions and the Oddity <urr> noted above, <ur> 
is always pronounced /3:/, and there appear to be no cases of <u, r> as 


separate graphemes. 


Despite the high percentage for <ur> pronounced /a/ | have not counted 
it as part of the main system because of the rarity of its converse - see 


section 5.4.7. 


See section 5.6.5 for the increasing replacement of /(j)ua/ by /(j)>:/. 


10.40 <y> 


THE MAIN SYSTEM 


For all these categories and the absence of percentages see Notes, and for 
a teaching rule relevant to word-final <y> see section 11.6. 


Basic word-initial /j/ 
phoneme 


Basic phoneme /at/ 
elsewhere 


Other phonemes _ /i:/ 


/1/ 


e.g. yellow, you, your; never occurs word- 
finally; rare medially, where almost all 
occurrences are vocalic 


regular word-finally where it is the only vowel 
letter, in the suffix <-fy>, and medially after 
<e>-deletion, e.g. fly, beautify, stylish 


usual in prefix poly- and regular word-finally 
where there is at least one earlier vowel letter 
(except in the suffix <-fy>), e.g. polytechnic, 
city, happy 


never word-final; almost exclusively medial; 
occurs in many words of (mainly) Greek origin, 
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THE REST 


Exceptions to main 
system 


Oddities 


2-phoneme graphemes 


<y> 


<ye> 


<y.e> 


<yr> 


<yr> 


e.g. bicycle, crystal; regular where it precedes a 


consonant letter and word-final <-ic(al>), and 


mainly before consonant clusters 


pronounced 


/a/ 


/at/ 


/at/ 


/a/ 
/31/ 


<yrrh> /3:/ 


<yr> 


as 2-phoneme 
sequence /ata/ 


only in pyjamas 


only word-final, and only in (good)bye, 
dye, lye, rye, Skye, stye 


only where the <e> is word-final and 
only in: acolyte, analyse, anodyne, 
azyme, breathalyse, byte, catalyse, 
chyle, chyme, coenocyte, condyle, 
dialyse, dyke, dyne, electrolyse, 
electrolyte, enzyme, formaldehyde, gybe, 
gyve, hythe, hype, leucocyte, neophyte 
and at least 14 other words ending in 
<-phyte> pronounced /fait/, paralyse, 
phagocyte, proselyte, rhyme, scythe, 
spondyle, style and about 20 derivatives, 
troglodyte, syce, thyme, tyke, type and 
at least 20 derivatives; also alternative 
US spellings such as analyze. See Notes 
for how this split digraph is defined 


only in martyr, satyr, zephyr 
only in gyrfalcon, myrmidon, myrtle 
only in myrrh 


only medial and only in empyrean, 
gyroscope, papyrus, pyrites, 
pyromaniac, thyroid, tyrant, tyro, 
tyrosine. |n all cases the <r> is both 
part of the digraph <yr> pronounced 
/ata/ and a grapheme in its own right 
pronounced /r/. For dual-functioning 
see section 7.1. Words in which <y, 
r> are separate graphemes include 
dithyramb(ic), myriad, porphyr-y/ia, 
syringa, syringe, syrup, tyranny, all with 
the relevant <y> pronounced /1/ 
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<yre> as 2-phoneme — only word-final and only in byre, gyre, 
sequence /aia/__ lyre, pyre, tyre. Some of these allow 
/r/-linking - see section 3.6 - e.g. 
pyromaniac and (with change of vowel 
and <r> spelling only /r/) lyrical 


NOTES 


Gontijo et al. (2003) (like Carney - see section 5.4.3) analyse word-final <y> 
(except where it is the only vowel letter and in <-fy>) as pronounced /1/. 
Because | instead analyse it as pronounced /i:/ and can not separate their 
final <y> pronounced /1/ from medial <y> pronounced /1/ | am unable to 
use their percentages for any of the correspondences of <y>. 

Initial <y> is always pronounced /j/ before a vowel letter. Cases of initial 
<y> followed by a consonant letter are very rare, but in all of them <y> 
is pronounced /1/, namely the archaic word yclept (‘named’), the type of 
boat called yngling, and the names of the plant and essential oil ylang- 
ylang (also spelt ilang-ilang) and of the elements ytterbium, yttrium and the 
names Yvette, Yvonne. 

Conversely, there are cases of medial <y> which are consonantal and 
are pronounced /j/. In a few, the <y> is solely a single-letter grapheme: 
banyan /'benjen/, beyond, biryani, bowyer /'bauja/, canyon /'kenjan/, 
halyard, lanyard, vineyard, yoyo. \n rather more the <y> functions both as 
a single-letter grapheme pronounced /j/ and as part of digraphs (for dual- 
functioning see section 7.1) with various pronunciations: 

e1/ in abeyance, bayonet, cayenne, layer, layette, mayonnaise, prayer 
pronounced /'preija/ (‘one who prays’), rayon; also in derived forms 
such as betrayal, conveyance 
/>1/ in arroyo, buoyant /‘bo1jant/, doyen pronounced /'‘doijan, do1'jen/, 
doyenne pronounced /do1'jen/, foyer pronounced /'‘foijer, ‘foija/, 
joyous, loyal, royal, soya; also in derived forms such as joyous 
/at/ in coyote /kat'jauti:/, kayak /'‘katjek/ 
/wat/ in doyenand doyenne pronounced /dwar'jen/, foyer pronounced 
/‘fwaijer/, voyeur /vwat'j3x/. 
Consonantal <y> and initial vocalic <y> having been dealt with, the main 
question is how to predict the three main vocalic pronunciations in medial 
and final positions. 
<y> is pronounced /at/: 
word-finally where it is the only vowel letter, namely by, cry, dry, fly, 
fry, my, ply, pry, scry, shy, sky, sly, spy, spry, sty, thy, try, why, wry, 
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plus buy, guy (taking <bu, gu> to be digraphs spelling /b, g/). No 
exceptions 
in the suffix <-fy>, e.g. beautify, classify, notify, including the three 
words with a preceding <e>: liquefy, putrefy, stupefy. The noun 
salsify (‘root vegetable’) is pronounced /'selsifi:/ but is not a real 
exception because here <fy> is not a suffix 
word-finally also in a few other words where it is not the only vowel 
letter: ally, ap/com/im/re/sup-ply, awry, defy, deny, descry, espy, 
July, multiply (verb - contrast the adverb, with /i:/), occupy, prophesy 
(contrast prophecy, with /i:/), rely 
medially in hundreds of words where <e>-deletion has occurred, e.g. 
stylish, typist 
medially otherwise in an unpredictable ragbag of words, including 
asylum, aureomycin, cryostat, cyanide, cycle, cyclone, cypress, (hama) 
dryad, dynamic, forsythia, glycogen, gynaecology, hyacinth, hyaline, 
hybrid, hydra, hydrangea, hydrant, hydraulic, hydrofoil, hydrogen 
and various other compounds of hydro-, hyena, hygiene, hygrometer, 
hymen, hyperbole and other compounds in hyper-, hyphen, hypothesis 
and other compounds in hypo-, lychee, myopic, nylon, psyche and 
almost all its derivatives, pylon, stymie, thylacine, thymus, typhoid, 
typhoon, typhus, xylophone, zygote and derivatives. 

<y> is pronounced /i:/: 
word-finally in most words where it is not the only vowel letter, e.g. 
city, happy and hundreds of others. For exceptions with /az/ see above 
in the prefix poly- when the stress does not fall on the <y>, e.g. 
polyandry (with following /j/-glide), polytechnic, polysyllable, 
polysyllabic (exception: polymer, with /1/). When the stress does fall 
on the <y> it is pronounced /1/, e.g. in polygamy 
medially otherwise in very few words (with following /j/-glide), e.g. 
caryatid, embryo(nic), halcyon. 

The only remaining occurrences of vocalic <y> are all medial and all 

pronounced /1/: 
regular where it precedes at least one consonant letter and either 
of the endings <-ic(al)>, e.g. cryptic, cyclic(al, cynic(al, paralytic, 
pyrrhic, salicylic, typical. \In this set cyclical), typical are exceptions 
(but apparently the only ones) to the rule (see above) that <e>-deletion 
before a suffix beginning with a vowel letter leaves a <y> froma 
previously split digraph pronounced /at1/ 
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regular in words where it is the only vowel letter and there is at least 
one following consonant letter, other than those with <-y.e>, but this 
is a small set: crypt, cyst, gym, gyp, hymn, lymph, lynch, lynx, nymph, 
pyx, sylph, tryst. Only exception: psych 
regular in where it is the last vowel letter and there is at least one 
following consonant letter, other than those with <-y.e>, e.g. abyss, 
acronym, amethyst, aneurysm, antonym, apocalypse, beryl, calyx, 
cataclysm, catalyst, chlorophyll and a few other words ending in 
-phyll, coccyx, (ptero)dactyl, di/triptych, eponym, hieroglyph, hydroxyl 
(second <y>), idyll, larynx, onyx, oryx, pharynx, polyp, sibyl, synonym 
(second <y>). No exceptions 
mostly predictable before two or more consonant letters or <x> where 
ther is at least one later vowel letter, e.g. apocrypha(l), asphyxiate, 
bi/tricycle, cryptic, crystal, cyclamen, cygnet, cymbal, eucalyptus, 
gryphon, gymkhana, gymnast/ ium, gypsum, gypsy, hypnosis, 
hypnotise, metempsychosis, paroxysm, pygmy, rhythm, strychnine, 
syllable, syllabic, syllabub, syllabus, sylloge, symbol, sympathy, 
syndicate, syntax, synthetic, syphilis. Exceptions, all with /at/: cycle, 
cyclone, cypress, forsythia, hybrid, all the words beginning hydr-, 
hygrometer, hyphen, lychee, psyche and almost all its derivatives, 
typhoid, typhoon, typhus 
otherwise in an unpredictable ragbag of words, including acetylene, 
analysis, analytic, chlamydia, cotyledon, cylinder, dithyramb(io), 
eponym(ous), glycerine, hypocrite, metempsychosis, myriad, oxygen, 
paralysis, physics, polymer, porphyria, sybarite, sycamore, sycophant, 
synonym (twice), syringa, syringe, syrup and first <y> in dynasty, 
etymology, hypocrisy, polygamy, porphyry, tyranny. 
The split digraph <y.e> is defined as covering words where the <e> is 
separated from the <y> by one consonant letter other than <r> and the 
<y> is not preceded by a vowel letter and the digraph is pronounced /ar/. 
The definition covers both words where the intervening consonant letter 
is an independent grapheme and words where the <e> is also part of a 
split digraph <ce, ve> - see sections 3.7.6-7, and section 7.1 for dual- 
functioning. The only extension needed is to cover two words with two 
intervening consonant letters forming a digraph: hythe, scythe, and there 
appear to be no exceptions. See also section A.6 in Appendix A. 
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10.41 Correspondences of <a, e, i, 0, u, y> 
(+word-final <e>) in content words with 
no other vowel letters (monosyllables) 


There is more pattern to the correspondences of the vowel letters in 
monosyllabic content words than comes through in the relevant sections of 
this chapter above - see Table 10.2, the inspiration for which | owe to Irina 
Shcherbakova of Moscow. (Most monosyllabic function words are so often 
unstressed that their predominant vowel is /a/). 

| have not included columns for the single vowel letters plus <w, y>, 
because over half the possible combinations do not occur, or for those 
sequences plus final <e>, because such words are rare. For <aw(e), 
ay(e), ew(e), ey(e), ow(e), oy> (<-oye> does not occur), see sections 
10.10/11/21/12/34/35 respectively. 

The comprehensiveness of Table 10.2 conceals the fact that, even where 
a cell does not say ‘(does not occur)’, there may be very few instances. This 
is true of all the cells in the ‘just the vowel letter’ column (see below), and 
of words ending in <-ure>: sure is the only example in its cell, and the only 
companions for cure are lure, pure; brae is also an isolate. 

Table 10.2 makes clear the parallelism in the correspondences of <i, y> 
in relevant words (though <y> is much rarer) - this is why I’ve put <y> next 
to <i>. Also, I’ve put <o, a> first because all the other vowel letter + <r> 
combinations are pronounced /3:/. Two more regularities are: 

Each vowel letter + <e> combination without an intervening consonant 
is pronounced the same as the corresponding split digraph; 
In word-final position <e, i, o> and sometimes <u> are pronounced 
like their letter-names (but <u> is sometimes /u:/, <a> is pronounced 
/a:/ and <y> is pronounced /at/). 
There are only about 19 exceptions to the regular short pronunciations 
before a single consonant letter: raj with /a:/, quad, quag, squat, swab, 
swan, swat, wad, wan, was, what with /b/, chic with /i:/, mic with /a1/, son, 
ton, won with /a/, pud, put, suk with /u/. 

The list of exceptions before geminate and other doubled spellings is 
longer, but still not extensive (the list would be shorter still in accents other 
than RP): chaff, staff, hajj, brass, class, glass, grass, pass, with /a:/; bass 
(‘(player of) large stringed instrument’/‘(singer with) low-pitched voice’) 
with /et/; all, ball, call, fall, gall, hall, pall, small, squall, stall, tall, thrall, 
wall with />:/; retch pronounced /ri:t{/; boll (sometimes), droll, poll (‘head, 
vote’), roll, scroll, stroll, toll with /au/. 


TABLE 10.2: REGULAR CORRESPONDENCES OF <a, e, i, 0, u, y> (+WORD-FINAL <e>) 
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IN MONOSYLLABIC CONTENT WORDS. 


word the vowel the vowel just the the vowel | the vowel the vowel the 
ending letter + letter vowel letter | letter + letter letter + vowel 
in > any single + any <e> + any <re> letter 
consonant geminate consonant + <I> 
letter except | or doubled letter 
<r, W, y> consonant (except <r, 
spelling w, X, Y>) 
+ <e>, 
= split 
digraph 
syllable closed closed open open closed open open 
type > 
vowel short short ‘long’, = ‘long’, = ‘long’, = r-coloured long 
sound > letter name | letter letter diphthong pure 
(except name name or vowel 
vowel <a, y>,and | (except (except 2-phoneme 
letter <u> without | <y>, <y>, sequence 
/j/ glide) and <u> | and <u> (except 
without without <ore>) 
/j/ glide) | /j/-glide) 
<o> /o/ /o/ /au/ /au/ /au/ /o1/ /o:/ 
rod lodge go roe rode fore for 
<a> /xe/ /xe/ /ax/ /et/ /e1/ /ea/ /ax/ 
man catch pa brae name care car 
<y> /1/ (does not /at/ /at/ /at/ /ate/ (does 
gym occur) sty stye Style pyre not 
occur) 
<i> /1/ /1/ /at/ /at/ /at/ /ata/ /31/ 
pin brick pi pie pine fire fir 
<e> /e/ /e/ /ix/ /ix/ /ix/ /1a/ /31/ 
men bell be bee Scene here her 
<u> /A/ /A/ /ux/ /ux/ /ux/ /ua/” /31/ 
without cut fuss flu flue flute sure fur 
/j/ glide 
<u> with | (does not (does not /jux/ /jux/ /jux/ /joa/” (does 
/j/ glide | occur) occur) mu cue cute cure not 
occur) 


* See section 5.6.5 for the increasing replacement of /(j)ua/ by /(j)>:/. 
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A further set arises from taking final <-ve> to be a doubled spelling. Although 
a few preceding single-letter vowels have the regular short pronunciation 
before <-ve>, namely have, give, live (verb, /l1v/) (see sections 3.7.7 and 
9.39), there are 18 words in which the <e> also forms a split digraph with the 
vowel letter, which therefore has a ‘long’ pronunciation (for dual-functioning 
see section 7.1): gave, shave, suave, wave; breve, eve; drive, five, hive, jive, 
live (adjective, /laiv/), swive, wive; cove, drove, move, prove; gyve (see the 
parallel list for polysyllables in the next section), and four words with an 
irregular short pronunciation: dove, glove, love, shove with /a/. 

There are just nine words in the language in which the sole vowel letter 
is followed by word-final <rr>: carr, charr, parr, err, chirr, shirr, whirr, 
burr, purr - but in every case the three letters form a trigraph, and these 
are therefore not really exceptions to the doubled consonant spelling rules 
in Table 10.2. This applies even more strongly to barre, bizarre, parterre, 
myrrh. 

There are only about 54 words with a single word-final vowel letter in 
the language, even when the dictionary is thoroughly scraped (and several 
function words are included); very few are exceptions - see Table 10.3. 


TABLE 10.3: OPEN MONOSYLLABLES WITH A SINGLE VOWEL LETTER. 


Vowel letter Words Pronunciation 
a bra, ma, pa, schwa, spa /ax/ 
e be, he, me, she, the (when stressed), we, ye /ix/ 
i Hi!, the pronoun /, and Greek letter names chi, phi, /at/ 


pi, psi, xi (as pronounced in English) 


mi,ti (the musical terms), ski /ix/ 
oO fro, go, lo, no /au/ 
do, to (when stressed), two, who /ux/ 
u flu, gnu if <gn> is analysed as pronounced /nj/ /ux/ 
gnu if <gn> is analysed as pronounced /n/, and /jux/ 
Greek letter names mu, nu (as pronounced in 
English) 
y by, cry, dry, fly, fry, my, ply, pry, scry, shy, sky, /at/ 


sly, spy, Spry, sty, thy, try, why, wry, plus buy, guy 
(taking <bu, gu> to be digraphs spelling /b, g/) 
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There appear to be only two exceptions for the vowel letter + <e> 
combinations (see sections 10.3/16/23/27/37/40), namely nee with /e1/ 
and shoe with /u:/ - but there are only about 40 such monosyllables in the 
entire language. 

There are very few words which end in a vowel letter + consonant letter 
other than <r, w, x, y> + final <e> (and therefore like look monosyllables 
with split digraphs), but in which the <e> is ‘pronounced’ and the words 
are therefore disyllables and the vowel letter + <e> do not constitute a split 
digraph: blase, cafe, glace, pate (‘paste’), hebe, stele, (bona) fide. (See also 
section 11.4). 

There are also very few words which end in a vowel letter + <re> and 
have an irregular pronunciation: are, ere, there, where, were (all of which 
are function words), and the only two exceptions for vowel letter plus <r> 
are war with />:/ and kir with /19/. 


10.42 Correspondences of <a, e, i, 0, u, y> in 
words with at least one later vowel letter 
other than ‘silent? <e> (polysyllables) 


Only two columns in Table 10.2 can be generalised more or less 
straightforwardly to polysyllables, which can be defined for the purposes 
of this section as all those (huge numbers of) words which do not fit the 
definition of ‘monosyllables’ given in the heading of the previous section. 

First, the single vowel letter graphemes are almost always pronounced 
‘short’ (i.e. as /# e10 A1/ respectively) before geminate and other doubled 
spellings in polysyllables as well as monosyllables - see Table 10.4, which 
is the mirror-image of Table 4.1. 


TABLE 10.4: SHORT AND LONG PRONUNCIATIONS OF SINGLE-LETTER VOWEL 
GRAPHEMES BEFORE SINGLE AND DOUBLE CONSONANT SPELLINGS. 


Before doubled Before other consonant clusters 
consonant spellings and single consonant letters 


Short vowel pronunciation | Regular Both occur, and long/short 


pronunciations are sometimes 


Long vowel/diphthong Very rare predictable but mostly not - see 


pronunciation the rest of this section 
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There are very few exceptions to the rule that single-letter vowel graphemes 
before geminate and other doubled spellings are pronounced short in 
polysyllables. This even applies to various short pronunciations which are 
exceptions to the main one, e.g. words with <a> pronounced /b/. The only 
exceptions I’ve been able to find are camellia, pizza with /i:/, distaff and 
sometimes /atte with /a:/, plimsoll (also spelt plimsole, which would not be 
an exception) with /au/, and thralldom (also spelt thraldom, which would 
not be an exception) with /3:/. The rule extends to consonant letter clusters 
which are or look like trigraphs (even though this not how | would analyse 
them): arrhythmia if pronounced with initial /ez/, butte with /ju:/, chenille, 
pelisse with /i:/, giraffe with /a:/ and ruche, tulle with /ur/. 

The largest (but still tiny) set of exceptions arises from analysing final 
<-ve> as a doubled spelling. Although most preceding single-letter vowel 
graphemes are pronounced short before <-ve> in polysyllables (see sections 
3.7.7 and 9.39), there are 14 words in which the <e> also forms a split 
digraph with the vowel letter, which therefore has a ‘long’ pronunciation (for 
dual-functioning see section 7.1): behave, conclave, forgave; alive, archive, 
arrive, deprive, naive, ogive, recitative, revive, survive; alcove, mangrove. 

The second column in Table 10.2 which generalises reasonably well to 
polysyllables concerns split digraphs. As can be seen in Table 11.3, there 
are only about 30 polysyllabic words in the language in which a word-final 
<e> separated from a preceding single vowel letter by a single consonant 
letter is ‘pronounced’ and therefore constitutes a separate syllable. 

In most polysyllables which end in a vowel letter plus <e> with no 
intervening consonant letter the digraphs are pronounced as in the 
corresponding monosyllables. Thus almost all of those in <-ee> are 
pronounced /i:/, exceptions (all with /er/) being entree, epee, fiancee, 
matinee, melee, negligee, soiree and a few other loanwords from French 
(see section 10.16), all of which are increasingly spelt in English with French 
<ée>. 

Those in <-ie, -ye> are all pronounced with /ar/. Almost all those in 
<-oe> are pronounced with /au/, the only exceptions being canoe, hoopoe 
with /ur/. However, most of those which end in <-ae> are Latinate (largely 
biological) terms with <ae> pronounced /i:/, and only sundae, tenebrae 
appear to have /er1/ like brae. Those in <-ue> fall into two subcategories: 
in most of those with <g, q> preceding <-ue> the three letters form a 
trigraph pronounced /g, k/, the only exceptions being argue with /ju:/ and 
dengue with the <u> forming a digraph with the <g> pronounced /g/, and 
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the <e> being pronounced /e1/ and constituting a separate syllable. All 

other words ending in <-ue> are pronounced with /(j)ur/. 

The rest of this section is an attempt to find other ‘rules’ for the 
pronunciation of the vowel letters as single-letter graphemes in polysyllables. 
The rules below (which should probably be called ‘generalisations’) are 
listed in a logical order which gradually narrows down their scope; in this 
respect the organisation is quite different from that adopted in the sections 
above on the single vowel letters. 

Some preliminaries: 

None of these rules apply to cases of consonantal <i, u, y>. However, | 
recognise that these are sometimes difficult to distinguish from cases 
where they have their vocalic pronunciations, and that some words 
slither between the two; 

None of these rules apply where the vowel letter forms a digraph 
with a following <I r w y>. However, again | recognise that these are 
sometimes difficult to distinguish from cases where the two letters are 
separate graphemes. In particular see <ar, er, ir, or, ur, yr>, sections 
10.7/19/26/31/39/40; 

Where (vocalic) <y> is not mentioned there are either no cases or so 
few that no generalisation about them seems worthwhile; 

In several cases, the pronunciation of <u> has to be given as /(j)u:/ - 
that is, it is either /ju:/ or /u:/ depending on other factors which are 
too complicated to include here - see section 10.36. 

1) The predominant pronunciations of <aeou> as single-letter graphemes 
when in ‘hiatus’, i.e. immediately before another pronounced vowel letter 
belonging to a separate syllable, are /ez i:/ (with following /j/-glide), 
/av (j)ur/ (with following /w/-glide). A few examples are aorta, archaic, 
chaos, chaotic, dais, kaolin, laity, prosaic, azalea, cameo, deity, 
erroneous, meteor, museum, neon, peony, petroleum, spontaneity, boa, 
heroic, poem, poetry, soloist, stoic; actual, annuity, bruin, continuity, 
cruel, cruet, dual, duel, fluid, genuine, gratuity, ruin, suicide, usual. 
There seem to be few or no exceptions. 

2) The predominant pronunciations of <i y> as single-letter graphemes 
when in hiatus appear to be /ar/ when stressed and /i:/ when unstressed 
(all with following /j/-glide), e.g. (stressed) bias, client, dial, giant, 
psychiatry, science, society, triad, triangle, variety, viaduct, violent, 
violet, cryostat, cyanide, dryad, dyad, hyacinth, hyaline; (unstressed) 
alien, battalion, caviar, cheviot, comedian, delirious, dubious, fasciitis, 
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glacier, histrionic, lenient, medium, myriad, odious, odium, polio, 
premier, radii, radium, radius, retaliate, soviet, taxiing, valiant, 
caryatid, embryo, halcyon, polyandry. Exceptions: brio, Shiite, skiing, 
trio with stressed<i> pronounced /i:/; hyena, myopic with unstressed 
<y> pronounced /at/. 

But the problem with both these categories is that some of the two-letter 

sequences involved function much more frequently as digraphs; this is 

particularly true of <ai ea ei eu oa oi ue>. Readers therefore just have to 
learn when these sequences are not digraphs - one bit of help here is that 
the second of two vowel letters in hiatus is never <y>. 

3) The predominant pronunciations of <a i o y> when word-final in 
polysyllabic words and unstressed are /a i: au i:/. The absence of <e> 
here is due to the fact that word-final letter <e> is almost always part 
of a digraph and hardly ever constitutes a separate syllable (for the 
few exceptions see above and sections 10.12 and 11.4). <u> is also 
very rare in these circumstances and is not worth including in the rule. 
And all six vowel letters are so rarely stressed when functioning as 
word-final single-letter graphemes that no rule is worth giving for that 
situation (but see section A.10 in Appendix A). 

All the following rules apply to cases where the vowel letter is followed by 

one or more consonant letters; this condition is stated only the first time. 

4) The predominant pronunciation of <a e i o u> as single-letter 
graphemes when unstressed before a consonant letter(s) is /a/, with 
a tendency for many instances of unstressed <e i> to be pronounced 
/1/ - but this is circular and uninformative; there are few indications in 
the spelling of English words of when a syllable is unstressed (except by 
implication from the few rules which predict where the stressed syllable 
is - see Appendix A, section A.10), of when these graphemes have other 
pronunciations when unstressed (e.g. the first <u> in museum, the first 
<y> in psychiatry), or of when other graphemes are pronounced /2/. 

From here on, all the rules in this section refer to occurrences of the vowel 

letters as single-letter graphemes when stressed, so these conditions are 

stated only the first time. 

5) The predominant pronunciations of <a e i o u y> as single-letter 
graphemes when stressed in the third (antepenultimate) and fourth 
syllables from the end of the word (that is, when the word continues 
(CV)CVCVC(silent <e>), where C can be one or more consonant letters) 
are /e e1vp (jjur1/. This applies to almost all words ending <-ical>, e.g. 
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classical, heretical, political, logical, musical, lyrical, and to many derived 
forms in which a suffix has lengthened a word and produced a change 
from a long to a short vowel sound, e.g. national, profanity, serenity, 
divinity, wilderness, the <e> in egotism. Other examples are acrobat, 
agriculture, animal, antagonism, cameo, caviar, glacier, madrigall, 
sacrament, scarify, valiant, vocabulary, and second <a> in battalion, 
cheviot, decorative, democrat, deprecate, detriment, premier, secretary, 
specify, citizen, delirious, military, misery, crocodile, monument, 
oxygen, profligate and soviet if pronounced with /v/ (if pronounced 
with /au/ it is an exception which instead obeys rule 10); crucifix, 
cucumber, dubious, fugitive, funeral, impunity, lubricant, lucrative, 
ludicrous, mutilate, mutiny, nuclear, pugilist, scrupulous, scrutiny; 
cyclamen, myriad, polygamy, porphyria, syllable, syllabub, syllabus, 
sylloge, typical, tyranny. For a major class of exceptions see rule 10. 
Other exceptions: agency, favourite, decency, obesity, penalise, bribery, 
library, microscope, nitrogen, rivalry, motorist, notify, soloist, culinary, 
gluttony, jugular, truculent, hydrogen. Some of these exceptions are 
derived forms retaining a letter-name vowel from the stem word. 

A corollary here is that so few words are long enough to have syllables 

before the fourth from last that no rules are worth giving for these ‘early 

syllables’. 

6) The predominant pronunciations of <a e io u> before two different 
consonant letters followed by word-final <-le, -re> in words with no 
earlier vowel letters are /e e 10 A/, e.g. angle, handle, tremble, uncle, 
muscle; centre, sceptre, spectre, lustre and most of the <-stle> group 
(except castle) - see section 3.7.6. 

7) The predominant pronunciations of <aeiouy> beforea consonant letter 
other than <I, r> followed by <I, r> where there is a later pronounced 
vowel letter (including the <e> of the 2-phoneme grapheme <-le>) 
are /eI ix az au (j)ur ar/. A few examples are able, cradle, maple; bible, 
disciple, idle, title, trifle; noble; bugle, duplex, scruple; cycle, cyclone; 
acre, April, apron, flagrant, fragrant, sabre, fibre, mitre; cobra, 
ogre; lucre, putrid; cypress, hybrid. Extension: ochre, where the first 
intervening consonant is represented by a digraph. Some exceptions: 
establish, treble, triple, goblet, goblin, problem, publish, acrid, Avril, 
petrol, citr-ic/on/ous, copra. 

8) The predominant pronunciations of <aei u> when followed by a single 
consonant letter and word-final <-ate, -et, -it, -ite, -ot, -ut, -ute> are 
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9) 


10) 


11) 


/w# e1 (jjur/. A few examples are gamut, granite, planet, tacit; legate, 
senate; rivet, limit, bigot, minute; unit. | can find no examples with <o>. 
Some exceptions: climate, pilot, private. 
The predominant pronunciations of <aeiou> when followed by a single 
consonant letter and word-final <-ic, -id, -it, -ule> are /e e 1 (j)ur/. 
A few examples are acid, rabid, squalid, tepid, frigid, timid, solid, stolid, 
cubic, humid, lucid, music, punic, putrid, runic, stupid, tunic. Among the 
few exceptions are acetic, fetid pronounced /'fi:ttd/ (also pronounced 
with /e/), graphemic, phonemic, scenic, chromic, and phobic and all its 
compounds. 
The predominant pronunciations of <a e i o u> before a single 
consonant letter (except <r>) and an ending containing any of <ea 
eo eou eu ia ie io iou iu> are /er ix 1 au (j)ur/, regardless of whether 
the ending contains two syllables or one. There are thousands of 
examples; a few are: (2-syllable ending, stress on antepenultimate - 
these words are exceptions to rule 5) azalea, alien, radium; meteor, 
comedian, lenient, medium, erroneous, petroleum, polio, odious, 
odium, dubious; (1-syllable ending, stress on penultimate) courageous, 
facial, nation, spacious, cohesion, specious, delicious, magician; ocean, 
social, quotient; crucial, solution. Exceptions: companion, pageant, 
ration, spaniel with /z/; discretion, precious, special with /e/; soviet if 
pronounced with /v/; bunion, onion with /A/. 
The predominant pronunciations of <a ei o u> as stressed single- 
letter graphemes before a single consonant letter and the endings 
<-al, -sive> are /ez ix az au (j)ur/, e.g. fatal, naval, legal, regal, venal; 
arrival, final, reprisal, rival, spinal. local, modal, opal, oval, proposal, 
total, vocal, brutal, ducal, frugal, refusal, tribunal, evasive, adhesive, 
decisive, corrosive, explosive, abusive, conclusive, intrusive. Exceptions: 
medal, metal, pedal, petal with /e/. Vowel letters preceding the ending 
<-ssive>, however, are ‘short’. 


Beyond this point, any further rules would apply to so few words that 


they are hardly worth stating, and, lamentably, there are large numbers 


of words which are not covered. The two largest gaps are probably (1) 


reduced pronunciations (/a, 1/) of the single vowel letters when unstressed; 


(2) long and short pronunciations of the single vowel letters before single 


consonant letters in circumstances other than those covered above. These 


are the places where the pronunciation of single vowel letters is at its most 


unpredictable from the spelling in English and requires most effort to learn, 
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and any attempt to show further regularities would be too complex to be 
useful because of the large numbers of exceptions. 

The elephant in the room in this section is: how can you tell from the 
written forms of English words where the main stress falls, in order to work 
out where some of my ‘rules’ apply? For some discussion of this see section 
A.10 in Appendix A. 

Inspection of the headings of sections 10.3-40 will show that rather more 
than might be expected (<air are aw ee e.e eer igh ir 0.e oj ore oy>) give the 
percentage of the basic pronunciation as 100%, and two others (<ie i.e>) 
are close to that. But many of the rest are somewhat or considerably lower, 
and in three cases (<ere i y>) no useful figures can be given. Overall (but | 
have not (yet) done the calculation) | would guess that the predictability of 
the pronunciations of main-system graphemes beginning with vowel letters 
may be about 60%. 


10.43 Consolation prize? 


The only consolation prize is that almost all multi-letter graphemes beginning 
with vowel letters have far fewer correspondences and more regularity than 
the vowels letters as single-letter graphemes have - yet even here there is an 
egregious exception: <ou>, with 8 minor correspondences. As with various 
other aspects of the system, there is no choice but to learn the rest. 


11. Evaluating some 
pronunciation rules for 
vowel graphemes 


In this chapter | assess the reliability or otherwise of just five rules which 
purport to help children and others taking their first steps in reading to 
generate accurate pronunciations of vowel graphemes. For some rules 
covering the VC(C) part of CVC(C) monosyllables which could well be useful 
at a slightly later stage, see section A.7 in Appendix A. 


11.1 Some history 


There is a long tradition of teachers looking for rules for pronouncing 
vowel graphemes, and almost as long a tradition of finding most of them 
unhelpful. For example, McLeod (1961, cited in Carney, 1994: 70-74) 
reported ‘the result of a survey to which 76 teachers in 28 Scottish schools 
contributed’. From 59 rules submitted McLeod set 32 aside ‘since they 
merely grouped words according to common suffixes’. Of the other 27 
only five are reading (grapheme-phoneme) rules, and only three of those 
concern vowel graphemes - they correspond to sections 11.2, 11.4 and 
11.5 below. (The other two reading rules found by McLeod and discussed 
by Carney concern consonant graphemes, namely <wr> pronounced /r/ 
(see section 9.40) and <ch> allegedly pronounced /Jf/ after <n>, which | 
have ignored. Except for a couple which cover very few words, all McLeod’s 
spelling (pbhoneme-grapheme) rules are covered, without this being made 
explicit, in chapters 4 and 6). 
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The most famous article in this tradition is Clymer (1963, reprinted 
1996). Of the 45 rules he discussed, five deal with syllabification, which 
is not relevant to this book, and six with word stress - see section A.10 
in Appendix A. The other 34 rules all deal with grapheme-phoneme 
correspondences, 10 with consonant graphemes, 23 with vowel graphemes, 
and one with a mixture of the two. Many are trivial, or special cases of more 
general rules; when all of that and duplications are sorted out, the rules for 
vowel graphemes reduce to the five discussed in sections 11.2-6, of which 
four are useful and one (the best known) is not. 

Johnston (2001) listed several replications of Clymer’s study between 
1967 and 1978; most arrived at similar conclusions. However, Gates (1983, 
1986) re-formulated some generalisations to make them more reliable (as 
| have in some cases below), and Burmeister (1968) focused specifically on 
the best-known rule - see section 11.2. Johnston (2001) herself re-visited 
several of Clymer’s rules for vowel graphemes without, in my opinion, 
adding anything of value. 


11.2 ‘When there are two vowels side by side, 
the long sound of the first one is heard 
and the second is usually silent.’ 


Often popularly stated as: ‘When two vowels go walking, the first does 
the talking.’ 

This rule has long been popular in North America, despite having been 
blown to pieces by Clymer (1963/1996). It was meant to tell children which 
of two adjacent vowel letters indicates the pronunciation of a digraph, but 
it is unclear, or underspecified, in seven respects: 

It does not say (presumably assumes teachers and children know) 
which letters are ‘vowels’, but it seems clear that <a, e, i, 0, u> are 
the intended vowel letters; 

It ignores the consonantal pronunciations of <i, u> when they precede 
other vowel letters, as in onion, language (see sections 10.22 and 
10.36), presumably because these are not relevant to initial instruction; 
It does not say (presumably assumes teachers and children know) 
what the ‘long sounds’ (or ‘talkings’) of these vowel letters are, but 
again it seems clear that the ‘letter-name’ sounds /eI, it, aI, au, jur/ 
are meant; 
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It is not clear why it says the second vowel letter is ‘usually’ silent - 
perhaps to allow for words like dais, zoology with two vowel letters 
which normally form a digraph but in particular words do not; 
It does not say whether sequences of two identical adjacent letters are 
to count as digraphs for this purpose, but | think the rule is meant to 
apply only to sequences of two different letters, so in what follows 
| have not looked at <aa, ee, ii (which never occurs as a digraph 
anyway), 00, uu (which only occurs in muumuu, vacuum)>, except for 
word-final <ee>; 
It doesn’t say whether <w, y> are to count as vowel letters for this 
purpose. In her re-evaluation of the rule Johnston (2001) decided to 
include <aw, ew, ow, ay, ey, oy> (<iw, uw, iy, uy> never occur as 
digraphs), so | have followed this; 
It takes no account of <ye>, the only vowel digraph with <y> as first 
letter, but since this occurs in only seven words, it can be ignored. 
There are two other possible sequences that never occur as digraphs: <iu, 
uo>. Assuming the 12 exclusions just mentioned (<aa, ee, ii, iu, iw, iy, 00, 
uo, uu, UW, Uy, ye>), there are 23 relevant vowel digraphs consisting of 
adjacent vowel letters or a vowel letter plus <w, y>. 

There is one set of words for which this rule holds true with few 
exceptions, namely monosyllables ending in <ae, ee, ie, oe, ue>, almost 
all of which (see Table 10.3) are pronounced with the letter-name sounds 
/eI, it, aI, av, jur/. Unfortunately (as Table 10.3 also shows), the total 
number of relevant words in the entire language is about 54. 

Within the set of 23 relevant vowel digraphs, 12 belong to the main 
system and 11 are Oddities; they are all shown in Table 11.1 with their 
predominant pronunciations (except for <ae, ie, oe, ue> in word-final 
position in monosyllables), and relevant percentages of occurrence of those 
pronunciations derived or deduced from chapter 10. 

From Table 11.1 is it clear that the rule only works for <ay, ea> and 
possibly <ai, ue> among main-system digraphs, plus four or five of the 
Oddities, a very poor result. 

Because so few digraphs actually conform to the rule Burmeister (1968) 
advocated teaching them in groups, of which those which do conform 
would be one - but her other groups were entirely artificial because they 
supported no generalisations at all, and therefore failed to set digraphs 
which conform to the rule sufficiently apart. 

Verdict: This rule should be consigned to oblivion, and digraphs should 
be taught individually. 
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TABLE 11.1: ‘WHEN TWO VOWELS GO WALKING, THE FIRST DOES THE TALKING’ 


Digraph | Predominant pronunciation(s) | Conforms to the rule? 

ai /et/ 43% No, unless said is 
/e/ 46% excluded 

au />:/ 46% No 
/0/ 43% 

aw />:/ 100% No 

ay /ert/ 100% Yes 

ea /ix/ 73% Yes 

ew /jux/ 84% No 

Main system | jo * /ix/ 73% No 

oi />1/ 100% No 

Ou /au/ 48% No 
/au/ 1% 

ow /au/ 45% No 
/au/ 44% 

oy />1/ 100% No 

ue” /jux/ 59% ? 
/ux/ 41% 

ae” /ix/ 62% No 

ao /ert/ 69% Yes 

ei /ix/ 69% Yes 

eo /a/ 70% No 
/ix/ only in people oT 

eu /ux/ 58% No 

ey /ix/ 76% Yes 

Oddities 

ia /a/ 57% No 
/at/ only in diamond 

io /a/ 100% No 

oa /au/ 96% Yes 

oe” /ix/ 65% ? 

ua /a/ 100% No 

ui /ux/ 73% No 


* For monosyllables ending in these digraphs, the rule is largely true. 


and two other very rare words - see section 10.12. 
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11.3 ‘When a written word has only one vowel 
letter, and that letter is followed by at least 
one consonant letter other than <r, w, y>, 
the vowel has its usual short pronunciation.’ 


A better-known version of the rule is ‘When a word has only one vowel and 
that vowel is in the middle, it is usually short’, but my formulation (above) 
is more accurate, partly because <a, 0, u> have alternative pronunciations. 
Even though <r, w, y> in these circumstances after a vowel letter always 
form a vowel digraph with the vowel letter, for teaching purposes it would 
clearly be better to treat them here as consonant letters. The rule applies 
mainly or entirely to closed monosyllables, and regardless of the number of 
consonant letters following the vowel letter. 

(English is rich in monosyllables - many years ago three American 
nerds compiled a list of 9,123 (Moser et al., 1957), and there was once 
a competition to find or devise the longest one (Gardner, 1979), defined 
in terms of letters rather than phonemes; the competition was won by an 
American poet named William Harman, with broughammed (‘travelled by 
brougham’, which can be pronounced /bru:md/ in General American but 
would have two syllables /'bru:wamd/ in RP). 

Most monosyllabic words in English are phonologically closed (end in a 
consonant phoneme(s)). Table 11.2 sets out the facts on my version of the 
rule, at least as far as the RP accent is concerned (it seems clear that <o> 
has no ‘short’ pronunciation at all in the General American accent - see 
Cruttenden (2014: 127) and Carney (1994: 59)). 


TABLE 11.2: PRONUNCIATIONS OF VOWEL LETTERS IN WORDS WITH A SINGLE, 
NON-FINAL VOWEL LETTER FOLLOWED BY AT LEAST ONE CONSONANT LETTER 
OTHER THAN <r, w, y>. 


Vowel Principal short | Other short Long pronunciations 

letter pronunciation pronunciations 

a /e/ /o/ in 25 words, /D:/ in 26 words, e.g. ball, salt, 
e.g. squash, was talk; 


/ax/ in 18 words, e.g. calm, half, 
/e1/ only in bass (the musical term) 


e /e/ - /it/ only in retch pronounced /ri:tf/ 
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TABLE 11.2: PRONUNCIATIONS OF VOWEL LETTERS IN WORDS WITH A SINGLE, 
NON-FINAL VOWEL LETTER FOLLOWED BY AT LEAST ONE CONSONANT LETTER 
OTHER THAN <r, w, y>, CONT. 


Vowel Principal short | Other short Long pronunciations 
letter pronunciation pronunciations 
i /1/ - /at/ in 18 words, e.g. child, find, 
pint, sign; 
/ix/ only in chic 
fe) /o/ /A/ in 8 words, e.g. | /au/ in 34 words, e.g. both, colt, 
SON; comb, don’t, gross, post, roll, told; 
/v/ only in wolf /ux/ only in tomb, whom, womb 
u /A/ /u/ in 14 words, 7 


e.g. bull, push 


y /1/ = = 


Thus the total number of exceptions, even counting both categories, is no 
more than 150, some of which beginner readers are unlikely to encounter, 
and there are undoubtedly thousands of words which obey the rule. | 


therefore consider it to have high reliability, probably over 90%, and well 
worth teaching. 


11.4 ‘When a final <e> is preceded by a 
consonant letter other than <r, w, x, y> and 
that consonant is preceded by a single vowel 
letter, the final <e> is silent and the other 
vowel letter has its letter-name (‘long’) sound.’ 

This is my attempt to formulate a rule for ‘magic <e>’/split digraphs that is 

more accurate than some current formulations, e.g. 

+ ‘The final <e> in a word is not pronounced’. 


- ‘<e> at the end of a word makes the preceding vowel in the word long’. 
Table 11.3 shows the relevant data. 
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TABLE 11.3: RELIABILITY OF RULES FOR SPLIT DIGRAPHS OR ‘MAGIC <e>’ 
WHERE THE INTERVENING LETTER IS NOT <r, w, x, y>. 


Split Predominant Alternative Major exceptions Words with 
digraph | pronunciation | long ‘pronounced’ final 
pronunciation <e> 
a.e /et/ 68% /ax/ 32%, Lots of words agape (‘love feast’), 
e.g. mirage with <-age, -ate> agave, biennale, 
pronounced blase, cafe, canape, 
/1d3, at/, e.g. curare, finale, glace, 
village, accurate kamikaze, karate, 
macrame, pate 
(‘paste’), sesame, 
tamale 
e.e /ix/ 100% - - hebe, machete, meze, 
naivete, protege, 
Stele, ukulele 
i.e /at/ 97% /ix/ 3%, bodice, give, live aborigine, anime, 
e.g. police (verb), lots of facsimile, (bona) fide, 
longer words with recipe, simile 
<-ive> pronounced 
/Iv/, @.g. massive; 
various words 
with <-ine, -ite> 
pronounced 
/In, It/, e.g. 
examine, definite 
0.e /au/ 95% - compote, gone, abalone, adobe, 
scone, shone with anemone, coyote, 
/0/, above, become, | epitome, extempore, 
come, done, dove, expose (‘report of 
glove, love, none, scandal’), furore 
Shove, some with pronounced 
/A/, welcome /fjua'roirer/ (also 
and adjectives in pronounced 
<-some> with /2a/ /'fjuars:/), 
guacamole, 
hyperbole, sylloge 
u.e /jux/ 89% /ux/ 11%, - resume (‘c.v.’) 


e.g. rude 


/at/ 100% 
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All the rules for split digraphs are predicated on the word-final <e> being 
‘silent’, so the first necessity is to exclude polysyllables in which it is 
‘pronounced’. Table 11.3 shows that there are only about 39 words in the 
language in which a final <e> separated from a single preceding vowel letter 
by one consonant letter is ‘pronounced’, and three, curare, extempore and 
furore pronounced /fjua'rairer/, have the banned letter <r> intervening. Of 
the 39 words, only cafe is at all frequent. 

The percentages shown were calculated without taking words with 
‘pronounced’ final <e> or the major exception categories into account (but 
most of the words in those categories would again not feature in beginner 
readers’ books), and this would reduce the strength of the main rules for 
<a.e, i.e, 0.e>, but on the whole the ‘magic <e>’ rules hold good and are 
worth teaching. Most learners will, | think, acquire the /u:/ pronunciation 
of <u.e> without even noticing that they have, or that /u:/ is different 
from /jux/, and also learn without noticing it that <y.e> has the same 
pronunciation as <i.e> (most words with <y.e> are rare, so this digraph 
should present no problem for reading when encountered). 

As it happens, the inclusion of <x> among the letters banned from the 
mid position in this rule excludes just three words in the entire language: 
annexe, axe, deluxe, so the rule could well be taught without <x>, and 
would then be parallel to the ‘short vowel’ rule in section 11.3. If consonant 
digraphs were admitted to the dot position for this analysis, other rare 
words would join the list with ‘pronounced’ final <e>, e.g. antistrophe, 
oche, strophe, synecdoche. 

For more on split digraphs and their definition, see section A.6 in 
Appendix A. 


11.5 ‘When <a> follows <qu, w, wh> 
and is not followed by <r>, or by 
any consonant letter plus <e>, it is 
pronounced /pD/.’ 
This rule is usually stated without the clause ‘and is not followed by <r>, or 
by any consonant letter plus <e>’, but this is essential to rule out the <ar> 
digraph and cases where ‘magic <e>’ would override (e.g. quake, wade, 


whale), and my version is therefore more accurate. There are 21 relevant 
words with <qua>, 42 with <wa>, and only what with <wha>. Of the 64 
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words, the only exceptions are walk, wall, water, all pronounced with /5:/, 
so this rule is highly reliable (95% if each word is given equal weighting). 
There are also seven words in which <a> is followed by <r(r)> but those 
letters do not form a di/trigraph and the <a> is pronounced /v/: quarantine, 
quarrel, quarry, warrant, warren, warrigal, warrior, but these need to be 
taught separately because in the great majority of words in which <ar> 
follows <qu, w, wh> it is pronounced either /3:/ (e.g. quart, ward, wharf) 


or /a/ (e.g. steward, towards). 


11.6 ‘When <y> is the final letter in a word, 
it always has a vowel sound, either alone 
or in combination with a preceding 
<a,e,o>.’ 

Given that word-final <y> is never a consonant letter, this rule is 100% 

reliable. Formulated like this, it may seem entirely obvious to proficient 


readers, but may be helpful to learners. <ay, ey, oy> are also covered in 


section 11.2. 


Appendix A: Assumptions 
and technicalities 


A.1 Citation forms 


This book is almost entirely concerned with the citation forms of words, that 
is, how they would be pronounced by people with RP accents who were asked 
to read them aloud from a list, and/or how the words’ pronunciations are 
transcribed in broad IPA in the Cambridge English Pronouncing Dictionary. 
However, quite a few words have what Carney (1994) calls ‘allegro’ and 
‘lento’ pronunciations, that is in more rapid and less rapid speech, and both 
may well feature in their citation forms if a sufficient sample of people is 
polled. | cover a few variants of this sort (see especially section 6.10), but it 
would be impossible to cover all of them. 


A.2 Phonemes 


Phoneticians disagree profoundly about the acoustic existence of phonemes. 
However, for the purposes of analysing any alphabetic spelling system 
it seems to me that assuming the psychological reality of phonemes is 
inescapable, and | have proceeded on that assumption. One justification 
might be that there are no possible correlations between parts of letters and 
aspects of the acoustic signal. Another might be that otherwise it is difficult 
to imagine how alphabets came to be invented in the first place. 

| have also assumed that long vowels take longer to pronounce than 
short vowels, even though the acoustic evidence shows this to be only partly 
true, if at all. 
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If phonemes are assumed to have some reality, how are they to be 
defined? The most basic definition is the one | offer in chapter 1: ‘distinctive 
speech sounds’, that is, differences in sound which make a difference to the 
meanings of words. Thus in English /b, p/ are phonemes because the words 
bad, pad (and many others) which differ in meaning differ in speech only in 
this respect. But a fuller definition would make it clear that phonemes exist 
in a dynamic system with others within (a particular variety of) a particular 
language. 

So distinctions which are phonemic in English may not be in some other 
languages (e.g. /I, r/ are not separate phonemes in Japanese or Kikuyu), 
while distinctions that are not phonemic in English may be so elsewhere. 
For example, unaspirated and aspirated /k, k", p, p”, t, t"/ are not phonemic 
in English (and are therefore difficult for monolingual speakers of English 
to tell apart without training) because the unaspirated versions occur only 
after /s/ (try holding a hand in front of your mouth and saying pin, spin 
and notice the puff after the /p/ in pin and its absence in spin) - but are 
phonemic in many languages of the Indian sub-continent (and are thought 
to have been so in classical Greek, where the six phonemes were written 
K, X, TT, &, T, 8 respectively (in modern Greek x, , 8 represent /x (as in 
Scots loch), f, 8/ respectively - and this shows where the values of two IPA 
symbols have come from). 


A.3 Syllables 


Though difficult to define rigorously, syllables are intuitively obvious - 
psycholinguists showed many decades ago that children can be taught very 
quickly how to count (or indicate by moving the right number of pebbles or 
other symbols) the syllables in words spoken to them by an experimenter - 
and that phonemes are much less intuitive and more difficult to count. 

In strict linguistic terms, therefore, as just implied, only spoken words 
have syllables, and written words do not - but ordinary usage can be very 
confusing here since dividing and hyphenating words at line ends in print is 
called ‘syllabification’ (in Britain; ‘syllabication’ in North America). The word 
extra can be used to show the difference. If this word ever needed to be split 
between lines it would presumably appear as: 


ex- 


tra, 
but its spoken syllables are /'ek - stra/, with the two phonemes represented 
by the <x> belonging to separate syllables. 
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There are two reasons for insisting that only spoken words have syllables. 
First, imagining that written words have syllables further confounds already 
confused attempts to predict word stress from the written forms of words 
(see section A.10 below). 

Secondly, even if it was thought useful to try to define syllables within 
written words, this would very quickly lead to problems. As the example of 
extra just demonstrated, it is often difficult, sometimes impossible, to say 
where the boundaries between written syllables are. 

However, all of this poses a problem for the grapheme-phoneme sections 
of this book, chapters 9 and 10 - especially chapter 10 - because it is 
sometimes necessary even So to refer to syllables, and therefore (explicitly 
or by implication) to the spoken forms of words, including resorting to 
circumlocutions such as ‘the syllable containing /s/ spelt <ce>’. Where 
this pinches most is in sections 10.41-42, where | attempt to give general 
rules for the pronunciation of the vowel letters as single-letter graphemes 
in monosyllables and polysyllables respectively, and in section A.10 below, 
where | summarise the difficulties involved in trying to predict where the 
stresses fall on English words, given only their written forms. For my attempt 
to get round some of this with clear definitions see the heading of section 
10.41 and the first paragraph of section 10.42. 


A.4 Graphemes 


The phoneme inventories of languages are mainly established by finding 
‘minimal pairs’, spoken words which differ in only one sound segment but 
have different meanings; see again bad/pad above, and for another example 
(the few pairs of English words differing only in /@/ v. /6/ such as wreath/ 
wreathe) see section 9.36. Some linguists try to establish the graphemes of 
an orthography similarly, that is by classifying all the letter shapes which 
differentiate written words with different meanings. For English, this would 
result in an inventory of about 50 graphemes - the upper- and lower-case 
versions of the 26 letters of the alphabet, plus 2 for the variant forms of 
lower-case <a, a> and <g, g> (unless those were called ‘allographs’ by 
analogy with the allophone variants of phonemes), possibly minus a few 
for letters with graphically similar upper- and lower-case forms <C, c; K, 
k; O, 0; P, p; S, s; U, u; V, v; W, w; X, x; Y, y; Z, z>, possibly plus some for 
‘ligatures’ (joined letters) which used to be used in print (e.g. <a> in words 
like cegis, Caesar), and possibly plus a few for common abbreviations and 
punctuation marks <&!,.@?:; ....> - but where would you stop? For 
example, are numerals graphemes? Also, this approach would signally fail 


460 Dictionary of the British English Spelling System 


to uncover any multi-letter graphemes, and by extension the feature | have 
labelled ‘dual-functioning’, both of which seem to me absolutely necessary 
in analysing English spelling. 

| have instead taken the (to me) more common-sense approach of asking 
which letters and letter-combinations represent which phonemes (chapters 
3 and 5), and then using the inventory of graphemes so established (chapter 
8) to work back to phonemes (chapters 9 and 10). 


A.5 Every letter belongs to a grapheme 
(almost) 


It is commonly believed that English spelling has lots of ‘silent letters’, 
‘magic <e>’ being the classic example, along with the first letter in word- 
initial clusters such as <kn, wr>. Well yes, but every alphabetic script is 
composed entirely of silent letters, if you think about it. What is meant is 
letters which might as well not be there, since the spelling would represent 
the same word-sound without them, e.g. 


write which could be rite (and is, in another meaning) 
honest whichcould be ‘onest 
friend whichcouldbe “frend 


beauty whichcouldbe “buty 


or letters which at their position in the written form of a word do not 
correspond to a phoneme at that position in the spoken form but may 
nevertheless affect its pronunciation, e.g. ‘magic <e>’. 

In my view the identification of silent letters is more a matter for spelling 
reformers than for teachers. Learners have to learn the current spellings, 
and have to develop ways of remembering non-obvious parts of the system 

- of which there are many more besides ‘silent letters’ (e.g. whether medial 

and linking /w/ and /j/ are represented in the spelling or not - see sections 
3.8.7-8 and 9.0). It may be more helpful to learners to be asked ‘How do 
we write /r/ at the beginning of writing?’ and told ‘<wr> at the beginning 
of a word is pronounced /r/’ than to be told ‘The <w> at the beginning of 
writing is silent.’ 

In accordance with this ideal have adopted a ‘principle of exhaustiveness’ 
(first proposed by Albrow, 1972, and adopted by Carney, 1994): that is, as 
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far as possible every letter in a word’s spelling should be allocated to one of 
the phonemes in its spoken form. So you will find that | have analysed <wr> 
as one of the spellings of /r/, <ho> as one of the spellings of /p/, <ie> as 
one of the spellings of /e/, <eau> as one of the spellings of the 2-phoneme 
sequence /ju:/, <ps> as one of the spellings of /s/ (as in psychology), etc., 
etc. 

On the whole, this works well. However, in section 6.10 you will find a 
whole set of elided vowels, cases where a vowel letter never corresponds to 
a phoneme, even in citation forms (or only in very conservative or artificial 
‘spelling guidance’ pronunciations), and where | consider it would be over 
the top to add sequences consisting of those vowel letters and the preceding 
consonant letters to the inventory of graphemes. This does lead to a fuzzy 
boundary on the category of graphemes, but it seems to me that complete 
consistency is unattainable here. 

The impossibility of completeness is particularly visible in the case of 
odd spellings of place- and personal names. For example, most of the 
letters in Leicester, Worcester can be assigned to phonemes in their spoken 
forms /‘lesta, ‘wusta/ (as can all the letters in the alternative spellings 
Lester, Wooster), except <ce>: if the principle of exhaustiveness is to be 
maintained, these letters would, | think, have to be combined with the <s> 
as a new grapheme corresponding to /s/ - but there is no warrant for a 
grapheme <ces> in the main vocabulary, so | have not added it to the 
inventory or used Leicester, Worcester as examples of /e/ spelt <ei> or 
/u/ spelt <or> respectively, even though these are in the inventory. And if 
you believe in Cholmondeley-Featherstonhaugh as a genuine spelling of a 
double-barrelled surname pronounced /'‘t{amli:'feenfa:/ you’d have to add 
/A/ spelt <ol>, and puzzle over how to divide <onde> between /m, |/ with 
no principled way of allocating any of the letters to either phoneme. Worst 
of all, between the initial /f/ and final /3:/ of /'feanfa:/ | can see no way 
of getting the <s, n> to be part of the spellings of /n, f/, which are in the 
opposite order to the letters. 


A.6 Split digraphs 


The classic case of a so-called ‘silent letter’ which does influence the 
pronunciations of words and cannot be removed without altering them 
is the ‘magic’ <e> in split digraphs - but how should split digraphs be 
defined? A first and superficially appealing definition would be ‘Cases of 
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<a, e, i, o, u, y> followed by a consonant letter and stem-final <e> where 

the <e> indicates the letter-name (‘long’) pronunciation of the vowel letter’. 

This will not work, because: 

1) the pronunciation of <y.e> is /at/, which is the name of <i> (and not 
/wat/, the name of <y>) 

2) <u.e> is pronounced not only as <jur> but also as <ur> 

3) all the other split digraphs except <y.e> also have pronunciations other 
than the ‘letter-name’ one. 

For my separate definitions of the six split digraphs | recognise see sections 

10.4/17/24/28/38/40. Herel attempt to generalise them inthis formulation: 


A split digraph consists of stem-final <e> preceded by (usually) a 
single consonant letter (other than <h, j, gq, r, w, x, y>) preceded by 
one of <a, e, i, 0, u, y> where that letter is not preceded by a vowel 
letter and where the digraph is pronounced either as the name of the 
first letter of the digraph or as another long vowel or diphthong. 
The last phrase covers the pronunciation of <y.e> and the /u:/ pronunciation 
of <u.e> as well as non-letter-name pronunciations of the other split 
digraphs. 

The exclusion of <h, j, q> from occupying the ‘dot’ position in a split 
digraph is mentioned solely for completeness (and only here, and not in 
any of the relevant sections of chapter 10): there are not, and cannot be, 
any such sequences as <ahe, eje, iqe>, etc., in stem-final position. The 
exclusion of <r, w, x, y> from occupying the ‘dot’ position keeps out <are, 
ere, ire, ore, ure, yre; awe, ewe, owe; aye, eye> which need to be analysed as 
trigraphs to account for their correspondences, and <axe, exe> which need 
to be analysed as having <xe> as a digraph separate from the preceding 
vowel letter. (Other combinations, e.g. <iwe, oxe, uye>, do not occur). 

Letter <g> as sole occupant of the ‘dot’ position is odd. There are no 
words ending <yge>, and very few ending <ege, ige, oge, uge> - see 
sections 10.17/24/28/38 - but there are hundreds ending <age>, many of 
which have neither of the split digraph pronunciations - see below, and see 
the entry for <a.e>, section 10.4, for the three competing pronunciations. 

The restriction to one intervening consonant letter has to be relaxed to 
allow for words where there is clearly a split digraph according to the rest 
of the definition but there are two consonant letters or <gu, qu> forming 
a consonant digraph intervening. This extension covers varying numbers 
of words under <a.e, i.e, 0.e, u.e, y.e> (and none under <e.e>), totalling 
about 64 in all. The full list of consonant digraphs which can occupy the dot 
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position is <ch, gn, gu, Il, mb, qu, ss, th, tt>, and most words containing 
them are unusual. There are also just 13 stem words with <n, g> or <s, 
t> spelling separate phonemes intervening in <a.e> pronounced /e1/ 
which seem to me to fit the definition and which | have decided to include: 
arrange, change, grange, mange, range, strange (plus the derived forms 
estrange, exchange); baste, chaste, haste, lambaste, paste, taste, waste, 
plus four oddities: caste with <a.e> pronounced /a:/ surrounding <st>, 
and three words with <squ> occupying the dot position: bisque, odalisque 
with surrounding <i.e> pronounced /i:/, and brusque pronounced /bru:sk/ 
(also pronounced /brask/, which requires an analysis not involving a split 
digraph) with surrounding <u.e> pronounced /u:/. The dot position cannot 
be occupied by any other multi-letter sequence, in my analysis. 

The stipulation that the leading letter in a split digraph must not be 
preceded by a vowel letter is needed to rule out vowel digraphs, etc., 
which do not need the final <e> to indicate their pronunciations. This 
differentiates my analysis from that of Mountford (1998), who recognises 
the following 12 ‘split trigraphs’ with two vowel letters preceding the dot: 
<ai.e, au.e, ea.e, ee.e, ei.e, eu.e, ia.e, ie.e, Oi.e, 00.e, OU.e, Ui.e>, plus 10 
more ‘split trigraphs’ with a consonant letter (counting not only <I. r> but 
also <w, y> as consonant letters) immediately preceding the dot: <al.e, 
ar.e, aw.e, er.e, ir.e, is.e, or.e, OW.e, Oy.e, ur.e>, and even the following five 
‘split four-letter graphemes’, all with two vowel letters and then a consonant 
letter preceding the dot: <ais.e, ear.e, ier.e, oar.e, our.e>. Some of these 
extended split graphemes were posited as far back as Cordts (1965). | have 
found none of them necessary in my analysis because all such cases yield 
instead to analyses with the letters before the dot forming graphemes in 
their own right, and the final <e> forming a di/trigraph with the preceding 
consonant letter(s). 

The letter-name pronunciations of <a.e, e.e, i.e, 0.€, Uu.e> as 
/@I, it, aI, au, jur/, plus <u.e, y.e> as /ur, ar/, then represent the obvious 
pronunciations of the split digraphs. 

However, under <a.e, e.e, i.e, 0.e> | have added /a:, et, i, u:/ 
respectively as alternative pronunciations to cover large numbers of words 
with <a.e> pronounced /a:/, just four stem words with <e.e> pronounced 
/er/, a moderately large set of words with <i.e> pronounced /i:/, and just 
six with <o.e> pronounced /u:/. With these extensions, five of the six split 
digraphs have two pronunciations each, the exception being <y.e>, which 
is only pronounced /atr/. It seems to me that these extra correspondences 
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of the split digraphs need to be analysed in this way and not, for example, 
as <a> in massage, etc., spelling /a:/ and the <e> only forming a digraph 
with <g>. 

It is noticeable that for <a.e, e.e, i.e> these extra correspondences 
derive from French spelling conventions. As Crystal (2012) and especially 
Upward and Davidson (2011) document, French words which entered the 
English language before about 1600 became anglicised in pronunciation 
and followed English spelling conventions, such as they were (there had 
been some inconsistencies in Anglo-Saxon (Old English) spelling - in 
particular it had no consistent system for distinguishing long and short 
vowels - and after 1066 Norman French scribes introduced others). But 
what | have in several places called ‘more recent’ borrowings of French 
words (that is, those which arrived after the completion of the Great Vowel 
Shift in about 1600) in almost all cases retained their French spellings 
and (approximations to) their French pronunciations, despite the fact that 
this has introduced new correspondences for some vowel graphemes and 
increased the number of inconsistencies. 

One of the very few words introduced after 1600 which did acquire an 
anglicised pronunciation is blouse: if it had behaved like other ‘more recent’ 
borrowings it would be pronounced closer to French as /blu:z/ rather than 
as anglicised /blauz/. The process of assimilation can be heard at work in 
garage, mauve, doyen(ne) and foyer: 

The General American pronunciation of garage as /ga'ra:3/ is closest 
to French, having only replaced /a/ with /9/ in the first syllable and 
// with /r/ in the second. In RP, there are two pronunciations. In 
/‘gzera:z/, the last two French phonemes have been retained, but the 
stress has shifted to the first syllable and the vowel in that syllable 
has shifted to /#/. In /‘gzr1d3/, anglicisation is complete: the second 
syllable is now pronounced as in the great majority of polysyllabic 
words ending <-age>, and the only phoneme which is still pronounced 
as in French is the initial /g/; 

In Britain, the pronunciation of mauve varies between French- 
like /mauv/, with the English diphthong /au/ replacing /o/, and 
/moiv/, with a fully anglicised vowel perhaps reflecting a ‘spelling 
pronunciation’ of <au>; the latter pronunciation is less usual; 

Both doyen and doyenne have a French-like pronunciation /dwat'jen/ 
and a mid-way pronunciation /do1'jen/ where the French 2-phoneme 
sequence /wat/ has shifted to /31/ but the stress has remained on the 
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second syllable; doyen (but not doyenne) also has the fully anglicised 
pronunciation /'do1jan/ where the stress has moved to the first syllable; 
Similarly, foyer has three pronunciations: /‘fwatjer/ with the French 
pure vowels /a, e/ anglicised to diphthongs /ai, e1/ and the stress 
shifted to the first syllable, /'fotjez/ (which | first noticed in an RP 
speaker in November 2013) with the first diphthong further anglicised 
to /a1/, and /'foija/ with the final vowel totally anglicised to schwa. 
In large numbers of cases where the consonant in the dot position is <c, 
g>, and a small number where it is <v>, the <e> also forms a digraph 
with the consonant letter (for dual-functioning see section 7.1) spelling 
(and pronounced) /s, dg (or /3/), v/ respectively. Traditionally, the <e> is 
said to ‘mark’ <c, g> as having their ‘soft’ pronunciations /s, d3/ (see also 
section A.8), and not their ‘hard’ pronunciations /k, g/. The alternative ‘soft’ 
pronunciation of <g> as /3/ never seems to be noticed, but this is the 
least frequent phoneme in spoken English and its spellings are rarely taught 
explicitly. <ve> is different: the <v> would spell and be pronounced /v/ 
without the <e>, which is present purely as a strong spelling convention 
(see section 3.7.7) even when the <e> is not part of a split digraph. 

When is a split digraph not a split digraph? Note the restriction to letter- 
names and alternative long vowels or diphthongs in my definition. There 
are also copious examples of words with final <e> preceded by a single 
consonant letter (other than <r, w, x, y>) preceded by <a, e, i, 0, u> not 
preceded by another vowel letter where <a, e, i, 0, u> have neither their 
letter-name pronunciations nor the alternative pronunciations listed above. 
For categories and lists see the exceptions mentioned under <a.e, e.e, i.e, 
0.e, u.e> in sections 10.4/17/24/28/38. (There are no such cases with 
<y.e>). 

In general, these are words where what appear to be the ‘leading’ vowel 
letters in split digraphs are pronounced ‘short’. In most cases the ‘leading’ 
vowel letter is pronounced /a/ or /1/, for example in mortgage (and lots of 
other words ending in unstressed <-age>), purchase, accurate (and lots of 
other words ending in <-ate>), college, diocese, bodice (and several other 
words ending in <-ice>), engine (and several other words ending in <-ine>), 
mortise, practise, premise, promise, treatise, definite (and several other 
words ending in <-ite>), give, massive (and thousands of other polysyllabic 
nouns and adjectives ending in <-ive>), fulsome, handsome (and all the 
other adjectives ending in <-some>), welcome, purpose, lettuce, minute 
(/‘minit/, ‘60 seconds’). Examples with other short vowel phonemes are 
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axe, have (when stressed) with /x#/, allege, annexe, clientele, cortege with 
<e>, compote, (be)gone, scone, shone with /v/, and above, become, come, 
deluxe, done, dove, glove, love, none, shove, some with /a/. It seems to me 
that all such cases, unlike those involving long vowels, diphthongs or /ju:/, 
are more economically analysed as having the relevant short vowel spelt 
variously <a, e, i, 0, u> and the word-final <e> forming a digraph with 
the intervening consonant letter. All of this admittedly produces a fuzzy 
boundary around split digraphs, and a possible source of confusion over 
some words which might have a split digraph pronunciation, or not, but it 
seems to me that complete consistency is not attainable in this area either. 

For an attempt at a pedagogical statement of the split digraph rule see 
section 11.4. 


A.7 Rhymes and phonograms (and rimes) 


To keep this section simple to start with, let us define rhymes as those 
endings of one-syllable words which sound the same in more than one 
word, e.g. the /u:t/ sounds of boot, coot, hoot, toot, etc. Phonograms are the 
corresponding parts of written words, in this case <oot>; alternative terms 
for ‘phonogram’ are ‘rime’ (in that spelling) and ‘(word) body’. | do not use 
‘body’ because it has more usual meanings in the language, and do not use 
‘rime’ in this section because it is confusing to use homophonic terms for 
the corresponding parts of spoken and written words (and because there is 
already a word ‘rime’ meaning ‘hoar frost’). 

There are claims (apparently first made by Adams, 1990: 85, but first 
systematically investigated by Treiman et al., 1995) in the literature on 
teaching children to read and spell that many of the alternative spellings 
of vowel phonemes and the alternative pronunciations of vowel graphemes, 
considered to be unpredictable in isolation, are more predictable if 
phonograms and rhymes are considered as units. The claim was originally 
confined to monosyllables with CVC phonological structure, which led to 
monosyllables with no initial consonant, and therefore VC structure, and 
relevant polysyllables, being mostly overlooked. Although my analysis is 
focused almost entirely at the segmental (phoneme and grapheme) level, 
| have examined this claim (in its original version plus VCs and some 
polysyllables), and found it largely unconvincing. 

In the spelling direction, it seems to me that there are no rhyme- 
phonogram correspondences in (C)VC monosyllables which would be worth 
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teaching as units because all the ‘families’ of words are too small and/or 
have too many exceptions. However, in (C)VCC monosyllables there are just 
two that might be worth teaching: 


/etnt/ spelt <-aint> in faint, paint, plaint, quaint, saint, taint 
(only exceptions: ain’t (sort of), feint). Despite applying to only 6 
monosyllables (against 2 exceptions), this is probably worth teaching 
because it generalises to the final syllables of 6 polysyllables: acquaint, 
attaint, complaint, con/di/re-straint and (despite possibly being split 
across syllable boundaries) to non-final syllables in at least 3 more: 
maintain, plaintiff, plaintive. Score: 6-2 in monosyllables, 15-2 
overall. 

/auld/ spelt <-old> in bold, cold, fold, gold, hold, old, scold, sold, 
told, wold, plus the final syllables in behold, cuckold, blind/mani-fold, 
marigold, scaffold, threshold. The only stem word exception is mould 
(and this is spelt mold in the US), but there is possible confusion with 
several past tenses/participles, some of which are homophones of 
the stem words: bowled, doled, foaled, holed, poled, polled, rolled, 
soled, strolled, tolled. Taking in non-final syllables of polysyllables 
appears to add just one example, solder, and one exception, shoulder 
(soldier does not qualify as either because here <old> spells /auldz/). 
Score: in monosyllables, 10-10 in UK, 11-9 in US; overall, 18-11 in 
UK, 19-10 in US. Despite the poor score in monosyllables, this would 
probably be worth teaching when children are clear about spelling 
regular past tenses and participles with <-ed>. 


For a clear example of a phonogram whose spelling is entirely predictable 
at the segmental level, and therefore not worth teaching as a unit, see the 
discussion of /i:z/ under /z/, section 3.7.8. 

Conversely, for the final syllable /zam/ which is always unstressed (and 
is therefore not a rhyme) and almost always spelt <-sm> (only exception: 
bosom), see section 3.5.4. To the latter might be added words ending /aus/ 
(see section 3.7.6). There are only four monosyllables with this rhyme: close 
(adjective/noun), dose, gross and the very rare surname Groce, but dozens 
of polysyllables (the ‘sugar’ words dextrose, glucose, lactose, sucrose (all 
of which have alternative pronunciations in /auz/), and the adjectives 
comatose, lachrymose, morose, verbose, etc.), the only exception without 
<-ose> being the verb engross. So ‘Word-final /aus/ is almost always spelt 
<-ose>’ is a reliable generalisation - but probably of very limited use to 
young children and their teachers. 
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In the reading aloud direction, it seems to me that there are just five 
phonogram-rhyme correspondences which would be worth teaching as 
units, three applicable mainly to (phonologically) (C)VC monosyllables and 
two to (C)VCC monosyllables: 


<-all> pronounced /9):1/ in all, ball, call, fall, gall, hall, pall, small, 
squall, stall, tall, thrall, wall. The only words which are exceptions in 
RP are mall, shall, with /zl/ - and mallis /mo:|/ in General American. 
These final syllables would have to be clearly distinguished from 
non-final syllables with <all>, e.g. alliance. Score: 13-2 in RP, 14-1 
in General American. 


<-ead> pronounced /ed/ in bread, dead, dread, head, lead (the 
metal), read (past tense and participle), spread, stead, thread, tread 
(exceptions: bead, knead, lead (verb), mead, plead, read (present 
tense), with /i:d/). This pattern generalises to breadth if we are 
feeling generous, and to the final syllables of two polysyllables: 
ahead, instead. \f we are feeling even more generous we can add 
some polysyllables with non-final <ead>, of which some but not all 
are derived forms of the relevant monosyllables: already, meadow, 
Reading, ready, steadfast, steady, treadle (contrast reading, beadle). 
Score: 11-6 in monosyllables, 20-8 overall. 


<-ind> pronounced /aind/ in bind, blind, find, grind, hind, kind, mind, 
rind, wind (‘turn’), plus one stem polysyllable, behind (exceptions: 
rescind, tamarind, wind (‘stiff breeze’) with /1nd/). These final 
syllables would have to be clearly distinguished from non-final 
syllables with <ind>, e.g. indicate. Score: 9-1 in monosyllables, 10-3 
overall. 


<-old> pronounced /auld/ in bold, cold, fold, gold, hold, old, scold, 
sold, told, wold, plus mold in US spelling and the polysyllables 
behold, cuckold, blind/mani-fold, marigold, scaffold, threshold and 
(non-finally) solder (only exception: soldier with /auldg/). Score: in 
monosyllables, 10-0 in RP, 11-0 in General American; overall, 18-1 
in RP, 19-1 in General American. 


<-ook> pronounced /uk/ in book, brook, cook, crook, hook, look, nook, 
rook, shook, took plus polysyllables Chinook, forsook (exceptions: 
gook, snook, spook, stook, gobbledegook, all with /u:k/). Score: 10-4 
in monosyllables, 12-5 overall. 
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This fairly meagre haul of reasonably reliable phonogram-rhyme 
correspondences would be worth adding to the similarly meagre haul of 
four reliable pronunciation rules for vowel graphemes analysed in chapter 
11, but even together they fail to dent the importance of focusing almost all 
phonics teaching for reading on the segmental level. And phonics teaching 
for spelling should be even less influenced by the two possibly usable 
rhyme-phonogram correspondences listed above. For supporting evidence, 
see Solity and Vousden (2008: 490), who found that teaching onset-rimes 
would mean ‘there would have to be a fourfold increase in the amount of 
information children would need to learn to read material aimed at children, 
and an eightfold increase to move on to adult-directed text’. 


A.8 Dual-functioning 


In section 7.1 | deliberately sidestep an obvious question: why analyse any 
letters at all as belonging to more than one grapheme at the same time? 

(Carney (1994: 37) was strongly opposed to ‘overlapping’ of graphemes, 
but did not analyse cases such as those | adduce here. However, he was 
rightly critical of Cordts (1965), who ‘for reasons not explained’ assigned 
some letters to two graphemes where this is not warranted, e.g. the <e> 
in cake to both <a.e>, which is essential, and to <ke> as a Spelling of /k/, 
which is entirely unnecessary). 

Well, if you do not assign some letters to more than one grapheme at the 
same time, the correspondences for some phonemes become even more 
complicated than they already are, and the results seem to me counter- 
intuitive. For example, it is clear that in care there are two graphemes <c, 
are> spelling the two phonemes /k, ea/. But how should derived forms 
such as caring be analysed? There are now five phonemes /k, ea, r, 1, 9/ of 
which /k, 1, n/ are obviously spelt <c, i, ng>. The /r/ is also obviously spelt 
<r> - but does this mean that /ea/ is now spelt only <a>? If so, should this 
analysis be extended to independently-occurring medial examples such 
as parent? Here there are six phonemes /p, ea, r, 3, n, t/, of which four 
/p, 2, n, t/ are obviously spelt <p, e, n, t>, leaving <a, r> to spell /ea, r/. 
Both here and in caring we could analyse the <a> as spelling /ea/ and <r> 
as spelling /r/ and add /ea/ spelt <a> to the list of correspondences - but 
then in scarce, scarcity /ea/ can only be analysed as spelt <ar>, so /ea/ 
spelt <ar> has to stay in the inventory. 
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The problems just with /ea/ ramify when we look at words like air, 
aeroplane, pair, pairing, mayor, mayoress, sombrero, scherzo, heir, heiress: 
/ea/ spelt <air, ayor, er, heir> must be in the inventory to account for air, 
pair, mayor, scherzo, heir, but if we rule out dual-functioning we have to 
add /ea/ spelt <ae, ai, ayo, e, hei> to the list of correspondences (and delete 
only /ea/ spelt <aer>), and also add <ayo, hei> to the list of graphemes 
(and delete only <aer>) to account for aeroplane, pairing, mayoress, 
sombrero, heiress. Similar considerations apply to other phonemes spelt 
with graphemes ending in <e, r, w, y> where | posit dual-functioning. | 
therefore conclude that my analysis is actually conceptually neater, and 
keeps the lists of graphemes and correspondences from growing even more 
enormous. 

Also, dual-functioning is an economical factor in English spelling in 
another sense - without it we would always have to spell various adjacent 
phonemes separately, and in many cases the system would have no well- 
motivated way of doing this. For example, bolero, bowie, buoyant, hero, 
jury, parent might have to be spelt “balairro, “boewie, “boiyant, “heerro, 
“joorry, ‘pairrant. And many other single-function spellings would probably 
be even stranger and more complicated. 

My dual-functioning analysis solves a problem to which Venezky (1970: 
53; in the following quotations | have edited Venezky’s symbols into those 
used here) says ‘no realistic solution is possible’ without adopting his 
proposal for a set of graphemes he designates as ‘markers’. These are, 
for example, the letter <e> in clothe and pace. He correctly notes that ‘in 
each word [the <e>] marks two separate patterns. In clothe [it] marks the 
correspondences <o> —/au/ and <th> — /6/; in pace it marks <a> — /e1/ 
and <c> — /s/,’ and goes on: ‘The traditionalist is faced with a dilemma 
here; are the units <o.e/a.e> or <the/ce>? Or shall we take a fine razor 
and split <e> into two parts so that both alternatives can be taken?’ This 
attempted reductio ad absurdum is proved meaningless once we accept 
that a letter can belong to two graphemes simultaneously. 


A.9 Graphemes containing apostrophes 


There are four of these in my analysis: <ey’re> spelling /ea/ in they’re, 
<e’er> spelling /ea/ as a contracted form of ever either independently or in 
(e.g.) ne’er, where’er, <e’re> spelling /1a/ in we’re, and <ou’re> spelling 
/>:/ in you’re. Although these are all contracted forms and not stem words 
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| have included them because their pronunciations are distinctive, and 
unpredictable from the uncontracted forms. 

Along the way | considered two other possible graphemes containing 
apostrophes for inclusion: <n’> spelling /an/ as in isn’t, etc., and <’s> as 
the regular singular and irregular plural possessive and the contraction of 
is/has. | decided against <n’> because it seemed neater to consider /an/ in 
these contractions as spelt solely by the <n> (see section 3.5.5). 

<’s> was in drafts of the book for a very long time, with three 
correspondences: /s/ after voiceless non-sibilant consonants, /z/ after 
vowels and voiced non-sibilant consonants, and /1z/ after sibilant 
consonants. It was the last of these that originally led me to include <’s>, 
on the grounds that it seemed a neat way of accounting for the 2-phoneme 
sequence /1z/ in this context; including this correspondence logically 
meant bringing in the other two. But | eventually took <’s> out on much the 
same grounds as for excluding <n’> - it seemed neater to consider /1z/ as 
spelt solely by the <s> (see section 3.7.8). Omitting this correspondence 
logically implied omitting the other two. 


A.10 Word stress 


In the sections of chapter 10 devoted to the vowel letters <a, e, i, 0, u> 
as single-letter graphemes | included /a/ as one of their pronunciations, 
accompanied by the comment ‘regular in unstressed syllables’ or ‘regular 
when unstressed’, and the words ‘stressed’ and ‘unstressed’ occur in many 
other places in that chapter and this Appendix. In doing so, | evaded what 
might be considered a key issue in deriving the pronunciations of English 
words from their written forms: how can you tell which syllables are stressed 
and which are not? 

The short answer is that this is a hugely complicated topic which could 
not fit in this book and would require a large, separate volume. 

Some would say that that volume already exists, in the shape of 
Chomsky and Halle’s (1968) famous study The Sound Pattern of English. 
But, technically, their rules for assigning stress, including and especially 
their Main Stress Rule, operate not on the written forms of English words 
but on abstract or ‘deep’ versions of them which, it is essential for present 
purposes to note, are phonological. Thus the entire analysis is strictly 
speaking irrelevant to the question of how to predict stress from written 
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forms. Moreover, Chomsky and Halle’s system is so complex that it is much 
too unwieldy for pedagogical purposes. 

As far as I’ve been able to discover, there are just two authors who have 
tackled the question of how to predict word stress in English from written 
forms. Wijk (1966: 124) says: 


Though it is not possible to lay down any completely satisfactory 
rules for the stressing of English words, it should be emphasised that 
there are vast numbers of words which do not offer any difficulty at 

all in this respect. 
and then proceeds (pp.125ff.) to present five principal categories of 
polysyllables for which he says it is possible to formulate a rule - but most 
do not seem well worked out, all except those | have adopted and numbered 
(1) and (2) below have copious exceptions, and all the rest tacitly assume 
that it is obvious how many syllables written English words have. 

Dickerson (1978) cites several authors who have said, in effect, that it 
is impossible to predict the stress patterns of English words - but they 
were all referring to trying to predict the stress pattern from the spoken 
form, that is, from the sequence of full and reduced (schwa) vowels. (Rule 
45 in Clymer, 1963/1996 ‘When the last syllable is the sound 7 [assuming 
this means a General American retroflex (‘r-coloured’) version of /a/] ‘it 
is unaccented [=unstressed]’ appears to be a confused statement of the 
obvious fact that in English /a/ is almost never stressed, but is wrong to 
imply that this is true only in word-final position.) But if you already know 
the sequence of full and reduced vowels in English words you already know 
the stress pattern, and if you already know the stress pattern you already 
know the sequence of full and reduced vowels, so the argument is circular. 
Native speakers of English usually already know both, and do not need to 
be able to deduce the stress pattern from the written form, except perhaps 
when we encounter an unfamiliar word - and then we may well be in the 
same boat as foreign learners. 

The serious implication here is that it can be difficult to work out from 
the spelling the pronunciation of a word you have never heard people say, 
especially if it is unusual, and easy to get some words wrong. For example, 
| have heard (or heard of) people pronouncing cotoneaster as /'kotani:sta/ 
(‘cotton-easter’) rather than /kea'tauni:tjesta/, machete as /ma'tfixt/ 
(‘muh-cheat’) rather than /ma'feti:/, oesophagus as /avu'witsaufegas/ 
(‘oh-wee-so-fag-us’) rather than /i:'sofagas/, Yosemite as /‘javzamatt/ 
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(‘yoh-zuh-might’) rather than /jo'semiti:/, and (possibly the most classic 
case) misled as /*mizald/ (like the verb mizzled). 

There are two words spelt forestage: the obvious two-syllable one 
pronounced /'fo:ste1d3/, ‘the part of a theatrical performance area between 
the curtain and the orchestra pit’, and with a morpheme boundary after 
fore-; and a three-syllable word from medieval English law with a morpheme 
boundary after forest-, pronounced /'foristid3/ and meaning either ‘a duty 
payable to the king’s foresters’ or ‘a service paid by foresters to the king’. 
Similarly, forestride, with a morpheme boundary after forest- and therefore 
more often spelt as two words, pronounced /'foristratd/ with three syllables, 
was briefly the name of a bus service in the Reading area; but it could be 
misread (and | did so misread it) as having a morpheme boundary after fore- 
and the two-syllable pronunciation /'fo:stratd/, and as perhaps meaning a 
specially determined way of walking (but there is no trace of such a word in 
the dictionaries). All of this again illustrates the need, which | stated right 
at the start of this book, to look up the pronunciation of whole words ina 
good pronouncing dictionary (that is, one which uses IPA symbols). 

Dickerson took what appeared to be a novel approach to the problem 
of deducing how English polysyllables are stressed from their written form. 
Unfortunately, | found it impossible to adopt. This is because he assumes 
that non-native learners of English know what a syllable is: 


To assign major stress to a word, only two syllables are relevant. One 
is called the Key Syllable, the other the Left Syllable, namely, the 
syllable immediately to the left of the Key. 


(Dickerson, 1978: 138) 

This also assumes that learners know where syllables in written English 
words begin and end and (as with Wijk) how many syllables there are in 
written English words. These are huge assumptions and, as | found when 
| tried to work them through, very awkward to specify in detail, mainly 
because they covertly assume that the learner already knows the spoken 
form of the word - which is precisely what Dickerson says he is trying not 
to assume. 

For example, it is true that the great majority of two-syllable words 
in English are stressed on the first syllable (as pointed out by Clymer, 
1963/1996, rule 30) - but how is a reader who does not already know (for 
example) the words blase, dais to deduce from their written forms that they 
have two syllables when all other words ending <-ase> or containing <ai> 
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between two consonant letters are monosyllables? Even if that prediction 
were possible, trying to make a usable rule for stress on two-syllable words 
would still entail listing hundreds of such words which are instead stressed 
on the second syllable, including dozens of cases where verbs are stressed 
on the second syllable and identically-spelt nouns/adjectives are stressed 
on the first. Indeed, Hunnicutt (1976) showed that a computerised version 
of the Chomsky-Halle rules could not assign the correct stress to such pairs. 

Another serious problem for any attempt to deduce the stress pattern of 
English words from their written forms arises from the existence of the elided 
vowels analysed in section 6.10 - it is hardly ever possible to deduce that a 
particular vowel letter in medial position (as opposed to stem-final ‘silent’ 
<e>) represents no phoneme at all and therefore isn’t even a candidate for 
taking the stress. Consider, for example, afferent (three syllables, first <e> 
pronounced /a/) and different (two syllables, first <e> elided). And in some 
words a particular vowel may be elided or not, often depending on accent, 
for example migratory, pronounced either as /'‘maigratri:/ (three syllables, 
stress on first, <o> elided) or as /mar'grettari:/ (four syllables, stress on 
second, no elision) or as /maigra'torri:/ (four syllables, stress on third, no 
elision). 

So here | have merely stated a few useful rules for determining which 
syllable in polysyllables has main stress. The three most useful rules for 
predicting main stress are: 

1) Virtually all words ending in <ea, eo, eou, eu, ia, io, iou, iu> followed 
by a single consonant letter or none and with at least one vowel letter 
earlier in the word have the stress on the syllable preceding <ea, eo, 
eou, eu, ia, io, iou, iu>, including all the hundreds of words ending 
<-tion> (as mentioned in rule 36 of Clymer, 1963/1996) and all those 
containing the five medial graphemes (other than <sh>) pronounced /J/ 

2) Virtually all words ending in <-ience, -iency, -ient, -(s)sive>, have the 
stress on the preceding syllable 

3) Virtually all words ending in <-ic(al)> have the stress on the preceding 
syllable. For exceptions see the last four paragraphs of section 10.22. 

Otherwise, there are only rules covering small numbers of cases, such as: 

4) Almost all words ending in <-ator> have the stress on the <a> if they 
have three syllables, e.g. credtor, curdtor, dictdtor, spectdtor, otherwise 
on the syllable two before the <a>, e.g. administrator, agitator, aviator, 
calculator, commentator, insulator. Only exceptions: consérvator, 
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conspirator, orator, prédator, sénator, which all have <a> pronounced 
/a/ and stress on the syllable before that 

5) The grapheme <air> is always stressed in polysyllables except in corsair 
(usually stressed on first syllable), millionairess (where the feminine 
ending <-ess> is usually stressed instead), mohair (always stressed on 
first syllable) 

6) All words ending in <-eer, -esce, -esque, -ique> have stress on the 
final syllable 

7) The grapheme <ier> is always stressed in polysyllables, except that 

frontier can be stressed on either syllable (but there are lots of words 
where <i, er> are separate graphemes) 

8) All words ending in <-tte> have stress on the final syllable except 
etiquette, omelette, palette, which have stress on the first syllable 

9) Almost all words ending in <-oon> have stress on the final syllable 
except forenoon, honeymoon, pantaloon, which have stress on the first 
syllable, and those ending in <-zoon> (‘living thing’), in which the 
ending has two syllables and is stressed on the first <o>. 

There is also one helpful rule for where main stress does not fall: 

10) The vowel letters as single-letter word-final graphemes in polysyllables 

are hardly ever stressed (Clymer, 1963/1996, rule 32 is a subset of 
this applying only to two-syllable words ending in a consonant letter 
followed by <y>). Strictly speaking this does not apply to word-final 
<e> as part of a split digraph, but that is of course never stressed either 
since there is no word-final vowel phoneme in such cases. There are 
very few exceptions, all of which are disyllables (and none at all with 
<i, u>): mama, papa; blase (which can also be stressed on the first 
syllable), manque, outre, risque; lasso, ally (if stressed on the second 
syllable, as the verb sometimes is), ap/com/im/re/sup-ply, defy, deny, 
descry, espy, July, rely. This rule implies that: 
in all other two-syllable words with a single vowel letter as the word- 
final grapheme (that is, those with only one other vowel grapheme 
earlier in the word), the stress falls on that other vowel grapheme (= 
the first syllable) 
a single-letter word-final vowel grapheme is never stressed in words 
of more than two syllables (except perhaps in unassimilated loanwords, 
e.g. Italian omerta), but this is of no help in predicting where the 
stress does fall in those longer words. 
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Similarly, Clymer’s (1963/1996) rule 35 ‘when ture is the final syllable in a 
word, it is unaccented’ (more accurately, ‘when <-ture> is word-final, it is 
unstressed’) is true, helpful if the word has only two syllables (e.g. picture), 
but useless for determining where the stress falls in longer words (e.g. 
furniture). And Clymer’s rule 31 ‘If a, in, re, ex, de, or be is the first syllable 
in a word, it is usually unaccented [=unstressed]’ poses huge problems. If 
it is meant to refer to prefixes, there is no way for anyone without deep 
etymological knowledge to tell when these word-beginnings are prefixes 
and when they are not. But if it is meant to refer to all words with these 
beginnings then it would be necessary to specify that each of them (except 
<ex-, in->) has to be followed by at least one consonant letter and all of 
them then by at least one vowel letter that is not ‘magic <e>’; and even then 
a quick scan of a dictionary reveals that there are far too many exceptions 
for the rule to be useful. 

So, beyond the few definitely or possibly useful rules given above, the 
task of predicting word stress from the written forms of English words also 
awaits another study. That study would have to avoid the assumption, which 
| have knowingly perpetrated/perpetuated in section 10.42 and in the ‘rules’ 
| have stated above, that readers can tell from the spelling of English words 
how many syllables their spoken forms contain. 


Appendix B: Pedagogically selected 
lists of phoneme-grapheme 
and grapheme-phoneme 
correspondences 


These lists are intended to be much more useful to teachers and to writers 
of early reading books than the full lists of correspondences in chapter 8. 
Similar tables (also largely devised by me) appeared in the Notes of Guidance 
to Letters and Sounds (DfES, 2007). As far as possible | have ensured that all 
words within the 1000 most frequent words in English whose correspond- 
ences are not covered by the major correspondences are listed in the 
right-hand columns of Tables B1-2 and in Tables B.6 and B.8. My source 
for the 1000 most frequent words was: http://en.wiktionary.org/w/index. 
php?title=Category: 1000_English_basic_words&pagefrom=stamp#mw- 
pages [last accessed 20/8/2012]. 

For guidance on the phonetics underpinning the application of these lists in 
phonics teaching see Burton (2011), which also contains versions of these lists. 


TABLE B.1: THE PHONEME-GRAPHEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, BY RP PHONEME, 1: CONSONANTS. 


Grapheme(s) . Common words with rare 
Phoneme As in... 

Basic Other graphemes for the phoneme 
/b/ b bb bed rabbit <bu> build buy 
/k/ c ckkq come back look queen | <cu> biscuit 

ch Christmas 
/tf/ ch tch children match <t> nature picture <ti> 
question 

/d/ d dd ed dad teddy called 


© 2015 Greg Brooks, CC BY http://dx.doi.org/10.11647/OBP.0053.13 
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TABLE B.1: THE PHONEME-GRAPHEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, BY RP PHONEME, 1: CONSONANTS, CONT. 


Grapheme(s) . Common words with rare 
Phoneme As in... 
Basic Other graphemes for the phoneme 
/f/ f ff ph from off elephant <ft> often soften <gh> cough 
enough laugh rough tough 
/g/ g gg get jogging <gh> ghost <gu> guess guy 
/h/ h horse <wh> who whole whose 
/d3/ j dg dge | just budgie bridge 
g ge giant orange 
/m/ m mm my mummy <mb> climb lamb thumb 
<me> come some 
<mn> autumn column 
/n/ n nn now dinner <gn> gnome sign 
<kn> knife knock knot know 
<ne> done engine none 
/9/ ng n sing sink <ngue> tongue 
/p/ p pp pen apple <ph> shepherd 
/r/ r rr red berry <rh> rgyme rhythm 
<wr> write wrong 
/s/ s ccese | sit city once horse <sc> science scissors <st> 
ss grass castle Christmas listen whistle 
/S/ sh ti ship station <ch> machine <ci> special 
<s> sugar sure 
<ss> issue pressure tissue 
<ssi> permission 
/3/ si vision <s> measure pleasure 
treasure usual 
/t/ t tt ed but little looked <pt> receipt <th> Thomas 
<tw> two 
/8/ th thing 
/d/ th that <the> breathe 
Iv/ Vv ve very have <bv> obvious <f> of 
/w/ w u went queen <wh> what when (etc.) wheel 
whistle white 
/wa/ spelt <o> once one 
/j/ y yellow <i> onion view 
/z/ z sseze | zoois please sneeze <si> business <ss> scissors 
Zz puzzle 
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TABLE B.2: THE PHONEME-GRAPHEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, BY RP PHONEME, 2: VOWELS. 


Phoneme Grapheme(s) Rea Common words with rare 
sin... 
Basic Other graphemes for the phoneme 
/ze/ a and 
/2/ a eero a the butter <ar> sugar <i> possible <our> 
button colour favour honour <re> 
centre <ure> nature picture 
/et/ a.e aai ay came bacon paint | <aigh> straight <ea> break 
day great steak <eigh> eight <ey> 
they 
/ea/ air are ar fair fare parent <ear> bear pear tear wear 
<ere> there where <eir> their 
Ja:/ ar a far ask <al> half <are> are <au> aunt 
laugh <ear> heart 
/e/ e ea went bread <a> any many <ai> again(st) 
said <ay> says <ie> friend 
/ix/ ee eeaeyiey | see he beach key | <e.e> these <eo> people <i.e> 
field city police 
/1a/ eer ear er ere cheer hear hero <ier> fierce 
here 
/31/ er ir or ur her girl word fur <ear> early earth heard learn 
<ere> were <our> journey 
/1/ i ey is England gym <a> language sausage <o> 
women <u> business minute 
/at/ ie iighy like | night my <ei> either <eigh> height 
<eye> eye <ye> goodbye 
/ate/ spelt <ir, ire, yre> biro 
fire wire tyre 
/o/ fe) a not was <au> because sausage <ho> 
honest honour <ou> cough 
/3u/ fe) 0.e OW so bone blow <oa> approach boat <oh> oh 
<ough> although 
/o1/ oi oy boil boy 
/v/ oo u book put <o> woman <oul> could 
should would 
/ux/ oo ew u u.e too blew super <o> do to two who <oe> shoe 
rule <o.e> lose move prove whose 
<ou> you <ough> through 
<ue> blue true <ui> fruit 
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TABLE B.2: THE PHONEME-GRAPHEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, BY RP PHONEME, 2: VOWELS, CONT. 


Phoneme 


Grapheme(s) 


Basic 


Other 


Asin... 


Common words with rare 
graphemes for the phoneme 


oor 


ure 


poor sure 


<our> tour 


or 


aar au aw 
ore 


for all warn sauce 
saw before 


<augh> caught naughty 

<oar> board <oor> door floor 
<ough> bought brought fought 
ought thought <our> course 
four your 


/au/ ou 


Ow 


out down 


/aue/ spelt <hour> hour /auea/ 
spelt <our, ower> flour flower 


but some 


<oo> blood flood <ou> 
country couple double 
encourage enough rough tough 
trouble young /wa/ spelt <o> 
once one 


TABLE B.3: THE PHONEME-GRAPHEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 3: 2-PHONEME SEQUENCES FREQUENTLY SPELT WITH SINGLE GRAPHEMES. 


2-phoneme Grapheme(s) aed 2-grapheme spellings for 
sin... 

sequence Basic Other same sequence 
/al/ (only le little animal label pencil carol 
word-final) beautiful 
/jux/ u eau ew | union beauty few | view you 

ue u.e argue cute 

/ks/ x box banks tricks politics 


N.B. The 2-phoneme sequence /kw/ is almost always spelt <qu> and should 
also be taught as a unit. 
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TABLE B.4: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 1: SINGLE GRAPHEMES FREQUENTLY PRONOUNCED AS 
2-PHONEME SEQUENCES. 


2-phoneme Other . 
Grapheme(s) Asin... 
sequence phonemes 
eau eW U Ue Ue /jux/ (too many to | beauty few union argue cute 
le (only word-final) /al/ list) little 
x /ks/ box 


N.B. The 2-grapheme sequence <qu> is almost always pronounced /kw/ and 
should be taught as a unit. 


TABLE B.5: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 2: MAJOR CORRESPONDENCES FOR CONSONANT GRAPHEMES. 


Phoneme(s) 
Grapheme(s) Asin... 
Basic Other 

b bb /b/ bed rabbit 
Cc /k/ /s/ come city 
ce /s/ once 
ch /tf/ /k/ children Christmas 
ck /k/ back 
d dd /d/ dad teddy 
dg(e) /d3/ budgie bridge 
ed /d/ /t/ called looked 
f ff /f/ from off 
g /g/ /d3/ get giant 
ge /d3/ orange 
gg /g/ jogging 
h /h/ horse 
j /d3/ just 
k /k/ look 
il /I/ leg ball 
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TABLE B.5: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 2: MAJOR CORRESPONDENCES FOR CONSONANT GRAPHEMES, CONT. 


Phoneme(s) 
Grapheme(s) Asin... 
Basic Other 

n /n/ /49/ now sink 
ng /n/ sing 
nn /n/ dinner 
p pp /p/ pen apple 
ph /f/ elephant 
q /k/ queen 
rer /r/ red berry 
sse /s/ /z/ sit is horse please 
sh /S/ ship 
si /3/ vision 
ss /s/ grass 
ttt /t/ but little 
tch /tf/ match 
th /8/ /d/ thing that 
ti /f/ /tf/ station question 
u /w/ queen 
vve Iv/ very have 
w /w/ went 
y /j/ yellow 
Z Ze ZZ /z/ Zoo sneeze puzzle 
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TABLE B.6: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 3: MINOR CORRESPONDENCES FOR CONSONANT GRAPHEMES. 
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Grapheme(s) Phoneme(s) Asin... 
bu /b/ build buy 
bv /v/ obvious 
ch ci /f/ machine special 
cu /k/ biscuit 
f /v/ of 
ft /f/ often soften 
gh /fg/ cough enough laugh rough tough; ghost 
gn /n/ gnome sign 
gu /g/ guess guy 
i /j/ onion view 
kn /n/ knife knock knot know 
mb me mn /m/ climb lamb thumb; come some; autumn column 
ne /n/ done engine none 
ngue /y/ tongue 
fo) /wa/ once one 
ph /p/ shepherd 
pt /t/ receipt 
rh /r/ rhyme rhythm 
s ssi /f/ sugar sure; permission 
s /3/ measure pleasure treasure usual 
sc /s/ science scissors 
si /z/ business 
ss /fz/ issue pressure tissue; scissors 
st /s/ castle Christmas listen whistle 
t /tf/ nature picture 
th tw /t/ Thomas two 
the /d/ breathe 
wh /hw/ who whole whose; what when (etc.) wheel whistle 
white 
wr /r/ write wrong 
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TABLE B.7: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 4: MAJOR CORRESPONDENCES FOR VOWEL GRAPHEMES. 


Phoneme(s) 
Grapheme(s) Asin... 
Basic Other 
a /ze/ /ela:po:a/ and bacon ask was all about 
a.e ai ay /e1/ came paint day 
air are /ea/ fair fare 
ar /ax/ /ea 21/ far parent warn 
au aw /o1/ sauce Saw 
e /e/ /ixta/ went he England the 
ea /ix/ /e/ beach bread 
ear eer ere /1a/ hear cheer here 
ee ey /ix/ see key 
er /3:/ /13 3/ her hero butter 
ew /ux/ blew 
/1/ /ar/ is | 
ie /ix/ field 
i.e igh /at/ like night 
ir /3:/ girl 
(o) /o/ /A3U 9/ not some so button 
0.e /au/ bone 
oi oy /o1/ boil boy 
oo /ux/ /v/ too book 
oor /o1/ /vea/ door poor 
or /o:/ /3:/ for worm 
ore /ox/ before 
ou /au/ out 
ow /au/ /au/ down blow 
u /A/ /vU ur/ but put super 
u.e /ux/ rule 
ur /3:/ fur 
y /at/ /Tix/ my gym city 
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TABLE B.8: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 5: MINOR CORRESPONDENCES FOR VOWEL GRAPHEMES. 


Grapheme(s) | Phoneme(s) Asin... 

a /e1/ any many; language sausage 

ai ay /e/ again(st) said says 

aigh /e1/ straight 

al are /ax/ half; are 

ar /a/ sugar 

au /a: o/ aunt laugh; because sausage 

augh /d:/ caught naughty 

ea ey /et/ break great steak; they 

ear /ea az 3:/ bear pear tear wear; heart; early earth heard learn 

e.e eo /ix/ these; people 

ei /at/ either 

eigh /el at/ eight; height 

eir /ea/ their 

ere /ea 3:/ there where; were 

eye /at/ eye 

ho /o/ honest honour 

hour /ava/ hour 

/2/ possible 

ie /e/ friend 

i.e /ix/ police 

ier /1a/ fierce 

ir ire /ate/ biro fire wire 

fe) /U Twa/ woman; women; once one 

0 0e 0.e /ux/ do to two who; shoe; lose move prove whose 

oa oh /au/ approach boat oh 

oar /o:/ board 

oo /A/ blood flood 

oor /o:/ door floor 

ou /o ur a/ cough; you; country couple double encourage enough 
rough tough trouble young 
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TABLE B.8: THE GRAPHEME-PHONEME CORRESPONDENCES OF BRITISH ENGLISH 
SPELLING, 5: MINOR CORRESPONDENCES FOR VOWEL GRAPHEMES, CONT. 


Grapheme(s) | Phoneme(s) Asin... 

ough /aU ur d1/ although; through; bought brought fought ought 
thought 

oul /u/ could should would 

our /a 31 uad1 colour favour honour; journey; tour; course four your; 

aue/ flour 

ower /avuea/ flower 

re /a/ centre 

u /1/ business minute 

ue ui /ux/ blue true fruit 

ure /a/ nature picture 

ye /at/ goodbye 

yre /ata/ tyre 
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